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"Expression System For Modulating an Immune Response" 

FIELD OF THE INVENTION 
[0001] The present invention relates generally to gene expression. More 
5 particularly, the present invention relates to methods for modulating the quality of an immune 
response to a target antigen in a mammal, which response results from the expression of a 
polynucleotide that encodes at least a portion of the target antigen, wherein the quality is 
modulated by replacing at least one codon of the polynucleotide with a synonymous codon 
that has a higher or lower preference of usage by the mammal to confer the immune response 
10 than the codon it replaces. Even more particularly, the present invention relates to the use of a 
protein-encoding polynucleotide whose codon composition has been modified for modulating 
the quality of an immune response to an antigen in a mammal. 

BACKGROUND OF THE INVENTION 
[0002] The expression of foreign heterologous genes in transformed cells is now 

15 commonplace. A large number of mammalian genes, including, for example, murine and 
human genes, have been successfully expressed in various host cells, including bacterial, 
yeast, insect, plant and mammalian host cells. Nevertheless, despite the burgeoning 
knowledge of expression systems and recombinant DNA technology, significant obstacles 
remain when one attempts to express a foreign or synthetic gene in a selected host cell. For 

20 example, translation of a synthetic gene, even when coupled with a strong promoter, often 
proceeds much more slowly than would be expected. The same is frequently true of 
exogenous genes that are foreign to the host cell. This lower than expected translation 
efficiency is often due to the protein coding regions of the gene having a codon usage pattern 
that does not resemble those of highly expressed genes in the host cell. It is known in this 

25 regard that codon utilization is highly biased and varies considerably in different organisms 
and that biases in codon usage can alter peptide elongation rates. It is also known that codon 
usage patterns are related to the relative abundance of tRNA isoacceptors, and that genes 
encoding proteins of high versus low abundance show differences in their codon preferences. 

[0003] The implications of codon preference phenomena on gene expression are 
30 manifest in that these phenomena can affect the translational efficiency of messenger RNA 
(mRNA). It is widely known in this regard that translation of "rare codons", for which the 
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corresponding iso-tRNA is in low abundance relative to other iso-tRNAs, may cause a 
ribosome to pause during translation which can lead to a failure to complete a nascent 
polypeptide chain and an uncoupling of transcription and translation. Thus, the expression of 
an exogenous gene may be impeded severely if a particular host cell of an organism or the 
5 organism itself has a low abundance of iso-tRNAs corresponding to one or more codons of 
the exogenous gene. Accordingly, a major aim of investigators in this field is to first ascertain 
the codon preference for particular cells in which an exogenous gene is to be expressed, and 
to subsequently alter the codon composition of that gene for optimized expression in those 
cells. 

1 0 [0004] Codon-optimization techniques are known for improving the translational 

kinetics of translationally inefficient protein coding regions. Traditionally, these techniques 
have been based on the replacement of codons that are rarely or infrequently used in the host 
cell with those that are host-preferred. Codon frequencies can be derived from literature 
sources for the highly expressed genes of many organisms (see, for example, Nakamura et al , 

15 1996, Nucleic Acids Res 24: 214-215). These frequencies are generally expressed on an 
'organism- wide average basis' as the percentage of occasions that a synonymous codon is 
used to encode a corresponding amino acid across a collection of protein-encoding genes of 
that organism, which are preferably highly expressed. 

[0005] Typically, codons are classified as: (a) "common" codons (or "preferred" 
20 codons) if their frequency of usage is above about 4/3 x the frequency of usage that would be 
expected in the absence of any bias in codon usage; (b) "rare" codons (or "non-preferred" 
codons) if their frequency of usage is below about 2/3 x the frequency of usage that would be 
expected in the absence of any bias in codon usage; and (c) "intermediate" codons (or "less 
preferred" codons) if their frequency of usage is in-between the frequency of usage of 
25 "common" codons and of "rare" codons. Since an amino acid can be encoded by 2, 3, 4 or 6 
codons, the frequency of usage of any selected codon, which would be expected in the 
absence of any bias in codon usage, will be dependent upon the number of synonymous 
codons which code for the same amino acid as the selected codon. Accordingly, for a 
particular amino acid, the frequency thresholds for classifying codons in the "common", 
30 "intermediate" and "rare" categories will be dependent upon the number of synonymous 

codons for that amino acid. Consequently, for amino acids having 6 choices of synonymous 
codon, the frequency of codon usage that would be expected in the absence of any bias in 
codon usage is 16% and thus the "common", "intermediate" and "rare" codons are defined as 
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those codons that have a frequency of usage above 20%, between 10 and 20% and below 
10%, respectively. For amino acids having 4 choices of synonymous codon, the frequency of 
codon usage that would be expected in the absence of codon usage bias is 25% and thus the 
"common", "intermediate" and "rare" codons are defined as those codons that have a 
5 frequency of usage above 33%, between 16 and 33% and below 16%, respectively. For 
isoleucine, which is the only amino acid having 3 choices of synonymous codon, the 
frequency of codon usage that would be expected in the absence of any bias in codon usage is 
33% and thus the "common", "intermediate" and "rare" codons for isoleucine are defined as 
those codons that have a frequency of usage above 45%, between 20 and 45% and below 

10 20%, respectively. For amino acids having 2 choices of synonymous codon, the frequency of 
codon usage that would be expected in the absence of codon usage bias is 50% and thus the 
"common", "intermediate" and "rare" codons are defined as those codons that have a 
frequency of usage above 60%, between 30 and 60% and below 30%, respectively. Thus, the 
categorization of codons into the "common", "intermediate" and "rare" classes (or 

15 "preferred", "less preferred" or "non preferred", respectively) has been based conventionally 
on a compilation of codon usage for an organism in general (e.g. , 'human- wide') or for a class 
of organisms in general (e.g., 'mammal- wide 9 ). For example, reference may be made to Seed 
(see U.S. Patent Serial Nos 5,786,464 and 5,795,737) who discloses preferred, less preferred 
and non-preferred codons for mammalian cells in general. However, the present inventor 

20 revealed in WO 99/02694 and in WO 00/42190 that there are substantial differences in the 

relative abundance of particular iso-tRNAs in different cells or tissues of a single multicellular 
organism (e.g. , a mammal or a plant) and that this plays a pivotal role in protein translation 
from a coding sequence with a given codon usage or composition. 

[0006] Thus, in contrast to the art-recognized presumption that different cells of a 
25 multicellular organism have the same bias in codon usage, it was revealed for the first time 
that one cell type of a multicellular organism uses codons in a manner distinct from another 
cell type of the same organism. In other words, it was discovered that different cells of an 
organism can exhibit different translational efficiencies for the same codon and that it was not 
possible to predict which codons would be preferred, less preferred or non preferred in a 
30 selected cell type. Accordingly, it was proposed that differences in codon translational 

efficiency between cell types could be exploited, together with codon composition of a gene, 
to regulate the production of a protein in, or to direct that production to, a chosen cell type. 
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[0007] Therefore, in order to optimize the expression of a protein-encoding 
polynucleotide in a particular cell type, WO 99/02694 and in WO 00/42190 teach that it is 
necessary to first determine the translational efficiency for each codon in that cell type, rather 
than to rely on codon frequencies calculated on an organism- wide average basis, and then to 
5 codon modify the polynucleotide based on that determination. 

[0008] The present inventor further disclosed in WO 2004/042059 a strategy for 
enhancing or reducing the quality of a selected phenotype that is displayed, or proposed to be 
displayed, by an organism of interest. The strategy involves codon modification of a 
polynucleotide that encodes a phenotype-associated polypeptide that either by itself, or in 

1 0 association with other molecules, in the organism of interest imparts or confers the selected 
phenotype upon the organism. Unlike previous methods, however, this strategy does not rely 
on data that provide a ranking of synonymous codons according to their preference of usage 
in an organism or class of organisms. Nor does it rely on data that provide a ranking of 
synonymous codons according to their translational efficiencies in one or more cells of the 

15 organism or class of organisms. Instead, it relies on ranking individual synonymous codons 
that code for an amino acid in the phenotype-associated polypeptide according to their 
preference of usage by the organism or class of organisms, or by a part thereof, for producing 
the selected phenotype. 
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SUMMARY OF THE INVENTION 
[0009] The present invention is predicated in part on the experimental 
detennination of a ranking of individual synonymous codons according to their preference for 
producing an immune response, including a humoral immune response, to an antigen in a 
5 mammal. Significantly, this ranking is not coterminous with a ranking of codon frequency 

values derivable from an analysis of the frequency with which codons are used to encode their 
corresponding amino acids across a collection of highly expressed mammalian protein- 
encoding genes, as for example disclosed by Seed (supra). Nor is it coterminous with a 
ranking of translational efficiency values obtained from an analysis of the translational 

10 efficiencies of codons in specific cell types, as disclosed for example in WO 99/02694 for 
COS-1 cells and epithelial cells and in WO 2004/024915 for CHO cells. Indeed, the present 
inventors have determined that codon modification of wild-type antigen-encoding 
polynucleotides to replace codons found in the wild-type sequence with codons having a 
higher preference for producing an immune response than the codons they replaced 

15 significantly enhances the immune response to the encoded antigen, as compared to the 
immune response obtained with the wild-type sequence. As a result, the present invention 
enables for the first time the construction of antigen-encoding polynucleotides, which are 
codon-optimized for efficient production of immune responses in a mammal. 

[0010] Thus, in one aspect of the present invention, methods are provided for 
20 constructing a synthetic polynucleotide from which a polypeptide is producible to confer an 
immune response to a target antigen in a mammal in a different quality than that conferred by 
a parent polynucleotide that encodes the same polypeptide, wherein the polypeptide 
corresponds to at least a portion of the target antigen. These methods generally comprise: (a) 
selecting a first codon of the parent polynucleotide for replacement with a synonymous 
25 codon, wherein the synonymous codon is selected on the basis that it exhibits a different 

preference for conferring an immune response ("an immune response preference") than the 
first codon in a comparison of immune response preferences; and (b) replacing the first codon 
with the synonymous codon to construct the synthetic polynucleotide, wherein the 
comparison of immune response preferences of the codons is represented by TABLE 1 : 
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TABLE 1 



Amino Acid 


Ranking of Immune Response Preferences for Synonymous 

Codons 


Ala 


Ala GCT > Ala GCC > (Ala GCA , Ala GCG ) 


Arg 


(Arg CGA , Arg CGC , Arg CGT , Arg AGA ) > (Arg AGG Arg CGG ) 


1 ^sn 


Asn AAC >Asn AAT 


Asp 


Asp GAC >Asp GAT 


Cys 


Cys TGC >Cys TGT 


Glu 


Glu GAA > Glu GAG 


Gin 


Gln CAA = Gln CAG 


Gly 


Gly GGA > (Gly GGG , Gly GGT , Gly GGC ) 


His 


His CAC = His CAT 


He 


Ile ATC » Ile ATT >Ile ATA 


Leu 


(Leu CTG Leu CTC ) > (Leu CTA , Leu CTT ) » Leu TTG > Leu™ 


I Lys 


Lys^ 0 = Lys^ 


Phe 


Phe 777 > Phe TTC 


Pro 


Pro ccc > Pro CCT » (Pro CCA , Pro CCG ) 


Ser 


Ser TCG » (Ser TCT , Ser TCA , Ser TCC ) » (Ser AGC , Ser AGT ) 


Thr 


A CCl . . .. a CC .. AC A - - APT 
■pj^ACG > Thr ACC >;> Tm .ACA > J hr ACI 


Tyr 


Tyr TAC >Tyr TAT 


Val 


(Val GTG , Val GTC ) > Val Grr > Val GTA 







[0011] Thus, a stronger or enhanced immune response to the target antigen (e.g. , an 
immune response that is at least about 110%, 150%, 200%, 300%, 400%, 500%, 600%, 
700%, 800%, 900%o, 1000% and all integer percentages in between, of that produced from the 
parent polynucleotide under identical conditions) can be achieved by selecting a synonymous 
codon that has a higher immune response preference than the first codon it replaces. In 
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specific embodiments, the synonymous codon is selected such that it has a higher immune 
response preference that is at least about 10% (and at least about 1 1% to at least about 1 000% 
and all integer percentages in between) higher than the immune response preference of the 
codon it replaces. In illustrative examples of this type, the first and synonymous codons are 
selected from TABLE 2: 



TABLE 2 



First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala 000 


Ala GCT 


Ile ATA 


IIe A7c 


Ser^ 7 


Ser 7CG 


Ala 000 


Ala 000 


Ile ATA 


Ile A7T 


Ser AGT 


Ser 7CT 


Ala GCA 


Ala 001 " 


Ile ATT 


Ile A7C 


Ser AGT 


Ser 7CA 


Ala 00 * 


Ala 000 






Ser AGT 


Ser 7CC 


Ala 000 


Ala 007 


Leu™ 


Leu C7G 


Ser AGC 


Ser 7CG 






Lou™ 


Leu C7C 


Ser^ 0 


Ser TCT 


Arg CGG 


Arg CGA 


Leu™ 


Leu C7A 


Ser AGC 


Ser 7CA 


Arg CGG 


Arg CGC 


Leu™ 


Leu C7T 1 


Ser AGC 


Ser 7CC 


Arg CGG 


Arg CGT 


Leu™ 


Leu 770 


Ser 7cc 


Ser TCG 


Arg CGG 


Arg AGA 


Leu" 0 


Leu C7G 


Ser 7CA 


Ser 7CG 


Arg AGG 


Arg CGA 


Leu 770 


Leu C7C 


Ser 7CT 


Ser 7CG 


Arg AGG 


Arg CGC 


Leu 170 


Leu C7A 






Arg AGG 


Arg CGT 


Leu 770 


Leu CTT 


Thr Acx 


Thr ACG 


Arg AGG 


Arg AGA 


Leu C7T 


Leu C7G 


Thr ACT 


Thr ACc 






Leu C7T 


Leu C7C 


Thr ACT 


Thr ACA 


Asn^ 7 


Asn AAC 


Leu C7A 


Leu CTG 


Thr ACA 


Thr ACG | 






Leu c7A 


Leu C7C 


Thr ACA 


Thr ACC 


Asp GAT 


Asp GAC 






Thr ACC 


Thr ACG 






Phe 770 


Phe 777 
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First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Cys TGT 


Cys TGC 






__ TAT 

Tyr TAT 


_ TAP 

Tyr TAC 






Pro CCG 


Pro ccc 






Glu GAG 


Glu GAA 


Pro CCCl 


PPT 

Pro CCT 


- , ,GTA 


^ r i GTG 






Pro CCA 


^ PPP 


Val° 1A 


Val° lc 


Gly 


Gly 


Pro CCA 


„ PPT 


^ t iPrTA 

Val orA 


w y ,PfTT 

Val°" 


Gly 


GGA 


p CCT 

Fro 


riO 


V ell 


Val GTG 

V dl 


GIy GGG 


Gly GGA 






Val GTT 


Val GTC 















[0012] In other illustrative examples of this type, the first and synonymous codons 
are selected from TABLE 3 : 

TABLE 3 



First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala 4 ** 3 


Ala GCT 


Leu™ 


Leu CTA 


Ser AGT 


Ser TCG 


Ala GCA 


Ala GCT 


Leu™ 


Leu CTT 


Ser AGT 


Ser TCT 


Ala 000 


Ala GCT 


Leu™ 


Leu TTG 


Ser^ 7 


Ser TCA 






Leu" 0 


Leu CTA 


Ser AGC 


Ser TCG 


Arg CGG 


Arg CGA 


Leu 770 


Leu CTT 


Ser AGC 


Ser TCT 


Arg CGG 


Arg CGT 






Ser AGC 


Ser TCA 


Arg CGG 


Arg AGA 


Phe™ 


Phe™ 


Ser AGC 


Ser TCC 


Arg AGG 


Arg CGA 






Ser TCC 


Ser TCG 


Arg AGG 


Arg CGT 


Pro CCG 


Pro CCT 


Ser TCA 


Ser TCG 


Arg AGG 


Arg AGA 


Pro CCA 


Pro CCT 


Ser TCT 


Ser TCG 
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First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon | 


First Codon 


Synonymous 
Codon 


Glu GAG 


Glu GAA 






mi ACT 

Thr 


»Tni ACG 

Thr 










mi ACT 

Thr 


™ ACA 

Thr 


Gly GGC 


Gly GGA 






mi ACA 

Thr 


mi ACG 

Thr 


Gly 007 


Gly 






mi ACC 

Thr 


mi ACG 

Thr 




vjiy 


















Val GTA 


Val GTT 















[0013] Suitably, in some of the illustrative examples noted above, the method 
further comprises selecting a second codon of the parent polynucleotide for replacement with 
a synonymous codon, wherein the synonymous codon is selected on the basis that it exhibits a 
higher immune response preference than the second codon in a comparison of immune 
response preferences; and (b) replacing the second codon with the synonymous codon, 
wherein the comparison of immune response preferences of the codons is represented by 
TABLE 4: 



TABLE 4 



Second 
Codon 


Synonymous 
Codon ' 


Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymous 
Codon 


Ala GCG 


Ala 007 


Ile ATA 


Ile ATC 


Ser AGT 


Ser TCG 


Ala 000 


Ala 000 


Ile ATA 


Ile ATT 


Ser AGT 


Ser TCT 


Ala GCA 


Ala GCT 


Ile ATT 


Ile ATC 


Ser AGT 


Ser TCA 


Ala GCA 


Ala GCC 






Ser AGT 


Ser TCC 


Ala GCC 


Ala GCT 


Leu™ 


Leu CTG 


So* 00 


Ser TCG 






Leu™ 


Leu CTC 


Ser AGC 


Ser TCT 


Arg CGG 


Arg CGA 


Leu™ 


Leu CTA 


Ser AGC 


Ser TCA 


Arg CGG 


Arg CGC 


Leu™ 


Leu CTT 


Ser AGC 


Ser TCC 
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Second 
Codon 


Synonymous 
Codon j 


Second 
Codon 


Synonymous 
Codon | 


Second 
Codon 


Synonymous 
Codon 


nnn 

Arg CGG 


nnT 

Arg 


_ TTA 

Leu TTA 


TTn 

Leu 770 


TCC 

Ser TCC 


_ TCG 

Ser ICG 


cnn 

Arg 


AfiA 

Arg 


TTG 

Leu 1 10 


Leu cl ° 


<-i TCA 

Ser 1CA 


01 TCG 

Ser ! ^ u 


AGG 

Arg AO ° 


CGA 

Arg COA 


TTG 

Leu"° 


x CTC 


0. TCT 


^ TCG 

Ser ^ U 


Ann 

Arg 


Arg CGC 


» TTG 








Ann 

Arg AGG 


nnT 

Arg 


Leu TTO 


_ pTT 

Leu c " 


ACT 

Thr ACI 


-T-^t ACG 

Thr 


Ann 

Arg AGG 


An a 

Arg 


_ CTT 


t TTG 

Leu clu j 


ml ACT 

Thr 


•n ACC 

Thr 








_ PTP 


_ ACT 


ACA 

Thr 


A AT 

Asn^ 7 


A AC 

Asn^ 0 


Leu CTA 


_ CTG 

Leu cl ° 


ACA 

Thr 


r-M-^1 ACG 

Thr 






Leu c,A 


t CTC 


T^l ACA 

Thr 


-r-i ACC 

Thr 


Asp GAT 


n ap 

Asp GAC 






ACC 

Thr 


t>i_ ACG 

Thr 






„ TTp 

Phe 770 


„ , TTT 

Phe 1M 






Cys TGT 


Cys TGC 






rx^ TAT 

Tyr 


m TAC 

Tyr 






ppp. 

Pro CCG 


„ ccc 






Glu GAG 


f"l A A 

Glu GAA 


ppp, 

Pro CCG 


„ CCT 


xr iGTA 

Val 


Val 






fpA 

Pro CCA 


Pro ccc 


XT' iGTA 

Val 


at iGTC 

Val 


Gly GGC 


nn a 

Gly GGA 


_ ppA 

Pro CLA 


CCT 

Pro 


t 7 iGTA 

Val 


Val 


ru, GCT 
yjly 


vjiy 


Pro CCT 


Pro ccc 


Val GTT 


Val G7G j 


Gly GGG 


Gly GGA 






Val GTT 


Val G7C 















[0014] Conversely, a weaker or reduced immune response to the target antigen 
(e.g., an immune response that is at less than about 90%, 80%, 70%, 60%, 50%, 40%, 30%, 
20%, 10%, 1% and all integer percentages in between, of that produced from the parent 
polynucleotide under identical conditions) can be achieved by selecting a synonymous codon 
that has a lower immune response preference than the first codon it replaces. In specific 
embodiments of this type, the synonymous codon is selected such that it has an immune 
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response preference that is less than about 90% of the immune response preference of the 
codon it replaces. In illustrative examples, the first and synonymous codons are selected from 
the TABLE 5: 

TABLE 5 



First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala 001 " 


Ala 000 


Ile ATC 


Ile ATA 


Ser TCG 


Ser TCT 


Ala 007 


Ala GCA 


Ile ATC 


Ile ATT j 


Ser TCG 


Ser TCA 


Ala GCT 


Ala GCC 


Ile ATT 


Ile ATA 


Ser TCG 


Ser TCC 


Ala GCC 


Ala GCG 






Ser TCG 


Sei^ 00 


Ala^ 0 


Ala GCA 


Leu CTG 


Leu CTA j 


Ser TCG 


Ser-" 57 






Leu CTG 


Leu CTT 


Ser TCT 


Ser* 00 


Arg CGA 


Arg AGG 


Leu CTG 


Leu TTG 


Ser TCT 


Ser^ 7 


Arg CGA 


Arg CGG 


Leu CTG 


Leu 7 ™ 


Ser TCA 


Ser AGC 


Arg CGC 


Arg AGG 


Leu CTC 


Leu CTA 


Ser TCA 


Ser AGT 




Arg CGG 


Leu CTC 


Leu crr 


Ser TCC 


Ser AGC 




Arg AGG 


Leu CTC 


Leu TTG 


Ser TCC 


Ser^ 7 


Arg CGT 


Arg CGG 


Leu CTC 


Leu™ 






Arg AGA 


Arg AGG 


Leu CTA 


Leu TTG 


Thr ACG 


Thr ACC 


Arg AGA 


Arg CGG 


Leu CTA 


Leu TTA 


Thr ACG 


Thr ACA 






Leu crr 


Leu 770 


Thr ACG 


Thr ACT 


Asn^ 0 


Asn^ 7 


Leu crr 


Leu TTA 


Thr ACC 


Thr ACA 






Leu 71 " 0 


Leu™ 


Thr ACC 


Thr Acr 


Asp GAC 


Asp GAT 






Thr ACA 


Thr ACT 






Phe 777 


Phe 770 






Cys TGC 


Cys TGT 






Tyr TAC 


Tyr 7AT 






Pro ccc 


Pro CCT 
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First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Glu GAA 


Glu GAG 


_ ccc 

Pro ccc 


p ro CCA 


Val 10 


Val uli 






_ ccc 

Pro ccc 


_ ccn 


. Val ul ° 


x t 1 GTA 

Val u A 


. r.r,A 

Gly 


Gly GGC 


^ CCT 


„ CCA 


^ t 1 GTC 


Val u 


Criy 


r T iv GGT 


rro 


Pro CCG 


V <Xl 


Val GTA 


Gly GGA 


Gly GGG 






Val GTT 


Val GTA 















[0015] In other illustrative examples, the first and synonymous codons are selected 
from TABLE 6: 

TABLE 6 



First Codon 


Synonymous 
Codon 


rirst Codon 


Synonymous 
Codon 


rirst L/Ouon 


0 y nun yiiiuus 

Codon 


Ala 001 


Ala GCG 


Leu CTA 


Leu TTG 


Ser 7CG 


Ser TCT 


Ala GCT 


Ala GCA 


Leu CTA 


Leu TTA 


Ser 7CG 


Ser 7CA 


Ala 007 


Ala 000 


Leu 077 


Leu 770 


Ser 7CG 


Ser 7CC 






Leu CTT 


Leu T7A 


Ser TCG 


Ser AGC 


Arg CGA 


Arg AGG 


Leu 11 " 6 


Leu 77A 


Ser 7CG 


Ser AGT 


Arg CGA 


Arg CGG 






Ser 7CT 


Ser AGC 


Arg CGT 


Arg AGG 


Phe 777 


Phe 770 


Ser 7CT 


Ser AGT 


Arg CGT 


Arg CGG 






Ser 7CA 


Ser AGC 


Arg AGA 


Arg AGG 


Pro CCT 


p ro CCA 


Ser 7CA 


Ser^ 7 


Arg AGA 


Arg CGG 


Pro CCT 


Pro CCG 


Ser 7CC 


Ser AGC 














Glu GAA 


Glu GAG 






Thr ACG 


Thr ACC 










Thj .ACG 


Thr ACA 
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First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Gly 


Gly 






Thr 


Thr ACT 


Gly GGA 


Gly GGT 






A (~~* A 

Thr 


a r^T 

Thr 


piGGA 

Crly 


r GGG 

Oly 


















Val Grr 


Val GTA 















[0016] Suitably, in some of the illustrative examples noted above, the method 
further comprises selecting a second codon of the parent polynucleotide for replacement with 
a synonymous codon, wherein the synonymous codon is selected on the basis that it exhibits a 
lower immune response preference than the second codon in a comparison of immune 
response preferences; and; (b) replacing the second codon with the synonymous codon, 
wherein the comparison of immune response preferences of the codons is represented by 
TABLE 7: 



TABLE 7 



Second 
Codon 


Synonymous 
Codon 1 


Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymous 
Codon 


Ala 001 " 


Ala 000 


Ile ATC 


Ile ATA 


Ser TCG 


Ser TCT 


Ala^ 1 " 


Ala 0 ^ 


Ile ATC 


Ile ATT 


Ser TCG 


Ser TCA 


Ala 001 


Ala 000 


Ile ATT 


Ile ATA 


Ser TCG 


Ser TCC 


Ala 000 


Ala 000 






Ser TCG 


Ser^ 


Ala 000 


Ala^ 


Leu CTG 


Leu CTA 


Ser TCG 


Ser AGT 






Leu CTG 


Leu CTT 


Ser TCT 


Ser AGC 


Arg CGA 


Arg AGG 


Leu CTG 


Leu TTG 


Ser TCT 


Ser AGT 


Arg CGA 


Arg CGG 


Leu CTG 


Leu TTA 


Ser TCA 


Ser AGC 


Arg CGC 


Arg AGG 


Leu CTC 


Leu CTA 


Ser TCA 


Ser AGT 


Arg CGC 


Arg CGG 


. Leu CTC 


Leu CTT 


Ser TCC 


Ser AGC 
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Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymous 
Codon 


CGT 

Arg CGT 


Ann 

Arg 


CTC 

Leu C7C 


ttp 

Leu 770 


TPG 

Ser 7CC 


AGT 

Ser AGT 


CGT 

Arg 


Arg 


PTP 

Leu olL 


_. XT A 

Leu" A 






AG A 

Arg AGA 


Arg AGO 


T PTA 

Leu clA 


T TTG 

Leu" 0 


APG 

Thr 


AGG 


AGA 

Arg AOA 


COG 

Arg CGG 


tt PTA 

Leu clA 


t TTA 

Leu 11A 


AGG 

Thr 


r-r^i AGA 

Thr 






_ PTT 

Leu L " 


_ TTG 

Leu" 0 


AGG 

Thr uu 


™, ACT 

Thr ACi 


A AC 


A AT 

Asn^ 7 


Leu CTT 


-w- TTA 

Leu 7 ™ 


AGG 

Thr 


rTM AGA 

Thr 






_ TTP 

Leu 770 


_ TTA 

Leu 71 A 


^ AGG 

Thr 


ACT 

Thr ACi 


HAP 

Asp GAC 


pat 

Asp GAT 






_.. AGA 

Thr 


r - r ^ 1 ACT 

Thr AL - ] 






_ TTT 

Phe 777 


TTP 

Phe 770 






Trip 

Cys TGC 


^- tpt 

Cys TGT 






_ TAC 

Tyr AC 


-T- TAT 

Tyr 






Pro ccc 


_ PPT 






/"I A A 

Glu 


Glu GAG 


ccc 


^ CCA 


^ r 1 GTG 


7 iGTT 

Val 






Pro ccc 


Pro cco 


x t i GTG 


«• T ,GTA 

Val 


Gly GGA 


/"in/""*' 

Gly 


r*PT 


„ CCA 

Pro OCA 


x t iGTC 

Val 


, 7 ,GTT 

Val 


nu.GGA 
Cjrly 


r T iv GGT 


Pro CCT 
rro 


Pro CCG 


Val GTC 

V <*1 


Val GTA 


Gly GGA 


Gly GGG 






Val 077 


Val GTA 















[0017] In another aspect, the invention provides a synthetic polynucleotide 
constructed according to any one of the above methods. 

[0018] In accordance with the present invention, synthetic polynucleotides that are 
constructed by methods described herein are useful for expression in a mammal to elicit an 
immune response to a target antigen. Accordingly, in yet another aspect, the present invention 
provides chimeric constructs that comprise a synthetic polynucleotide of the invention, which 
is operably connected to a regulatory polynucleotide. 
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[0019] In some embodiments, the chimeric construct is in the form of a 
pharmaceutical composition that optionally comprises a pharmaceutically acceptable 
excipient and/or carrier. Accordingly, in another aspect, the invention provides 
pharmaceutical compositions that are useful for modulating an immune response to a target 
5 antigen in a mammal, which response is conferred by the expression of a parent 

polynucleotide that encodes a polypeptide corresponding to at least a portion of the target 
antigen. These compositions generally comprise a chimeric construct and a pharmaceutically 
acceptable excipient and/or carrier, wherein the chimeric construct comprises a synthetic 
polynucleotide that is operably connected to a regulatory polynucleotide and that is 

1 0 distinguished from the parent polynucleotide by the replacement of a first codon in the parent 
polynucleotide with a synonymous codon that has a different immune response preference 
than the first codon and wherein the first and synonymous codons are selected according to 
any one of TABLES 2, 3, 5 and 6. In some embodiments, the compositions further comprise 
an adjuvant that enhances the effectiveness of the immune response. In some embodiments, 

15 the composition is formulated for transcutaneous or dermal administration, e.g., by biolistic or 
microneedle delivery or by intradermal injection. Suitably, in embodiments in which a 
stronger or enhanced immune response to the target antigen is desired, the first and 
synonymous codons are selected according to TABLES 2 or 3. Conversely, in embodiments 
in which a weaker or reduced immune response to the target antigen is desired, the first and 

20 synonymous codons are selected according to TABLES 5 or 6. 

[0020] In yet another aspect, the invention embraces methods of modulating the 
quality of an immune response to a target antigen in a mammal, which response is conferred 
by the expression of a parent polynucleotide that encodes a polypeptide corresponding to at 
least a portion of the target antigen. These methods generally comprise: introducing into the 

25 mammal a synthetic polynucleotide that is operably connected to a regulatory polynucleotide 
and that is distinguished from the parent polynucleotide by the replacement of a first codon in 
the parent polynucleotide with a synonymous codon that has a different immune response 
preference than the first codon and wherein the first and synonymous codons are selected 
according to any one of TABLES 2, 3, 5 and 6. In these methods, expression of the synthetic 

30 polynucleotide results in a different quality (e.g., stronger or weaker) of immune response 

than the one obtained through expression of the parent polynucleotide under the same 

conditions. Suitably, the chimeric construct is introduced into the mammal by delivering the 

construct to antigen-presenting cells (e.g., dendritic cells, macrophages, Langerhans cells or 

their precursors) of the mammal. In some embodiments, the chimeric construct is introduced 
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into the dermis and/or epidermis of the mammal (e.g., by transcutaneous or intradermal 
administration) and in this regard any suitable administration site is envisaged including the 
abdomen. Generally, the immune response is selected from a cell-mediated response and a 
humoral immune response. In some embodiments, the immune response is a humoral immune 
5 response. In other embodiments, the immune response is a cellular immune response. In still 
other embodiments, the immune response is a humoral immune response and a cellular 
immune response. 

[0021] In a related aspect, the invention encompasses methods of enhancing the 
quality of an immune response to a target antigen in a mammal, which response is conferred 

10 by the expression of a parent polynucleotide that encodes a polypeptide corresponding to at 
least a portion of the target antigen. These methods generally comprise: introducing into the 
mammal a chimeric construct comprising a synthetic polynucleotide that is operably 
connected to a regulatory polynucleotide and that is distinguished from the parent 
polynucleotide by the replacement of a first codon in the parent polynucleotide with a 

15 synonymous codon that has a higher immune response preference than the first codon, 

wherein the first and synonymous codons are selected according to TABLES 2 or 3. In these 
methods, expression of the synthetic polynucleotide typically results in a stronger or enhanced 
immune response than the one obtained through expression of the parent polynucleotide under 
the same conditions. 

20 [0022] In another related aspect, the invention extends to methods of reducing the 

quality of an immune response to a target antigen in a mammal, which response is conferred 
by the expression of a parent polynucleotide that encodes a polypeptide corresponding to at 
least a portion of the target antigen. These methods generally comprise: introducing into the 
mammal a chimeric construct comprising a synthetic polynucleotide that is operably 

25 connected to a regulatory polynucleotide and that is distinguished from the parent 

polynucleotide by the replacement of a first codon in the parent polynucleotide with a 
synonymous codon that has a lower immune response preference than the first codon, wherein 
the first and synonymous codons are selected according to TABLES 5 or 6. In these methods, 
expression of the synthetic polynucleotide typically results in a weaker or reduced immune 

30 response than the one obtained through expression of the parent polynucleotide under the 
same conditions. 

[0023] Yet a further aspect of the present invention embraces methods of enhancing 
the quality of an immune response to a target antigen in a mammal, which response is 
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conferred by the expression of a first polynucleotide that encodes a polypeptide corresponding 
to at least a portion of the target antigen. These methods generally comprise: co-introducing 
into the mammal a first nucleic acid construct comprising the first polynucleotide in operable 
connection with a regulatory polynucleotide; and a second nucleic acid construct comprising a 
second polynucleotide that is operably connected to a regulatory polynucleotide and that 
encodes an iso-tRNA corresponding to a codon of the first polynucleotide, wherein the codon 
has a low or intermediate immune response preference and is selected from the group 
consisting of Ala GCA , Ala GCG Ala GCC , Arg AGG Arg CGG Asn^, Asp GAT , Cys TGT , Glu GAG 
Gly GGG Gly GGT , Gly GGC , Ile ATA , Ile ATT , Leu TTG Leu TTA , Leu CTA , Leu CTT , Phe TTC , Pro CCA , 
Pro CCG Pro CCT , Ser AGG Ser^ 7 , Ser TCT , Ser TCA 5 Ser TCC , Thr^, Thr ACT , Tyr TAT , Val GTA and 
Val GTT . In specific embodiments, the codon has a 'low 5 immune response preference, and is 
selected from the group consisting of Ala GCA , Ala GCG Arg AGG Arg CGG Asn^, Asp GAT , 
Cys TGT , Glu GAG Gly GGG Gly GGT , Gly GGC , Ile ATA , Leu TTG Leu TTA , Phe TTG Pro CCA , Pro CCG 
Ser^, Ser AGT , Thr ACT , Tyr TAT and Val GTA 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0024] Figure 1 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted ALA E7 constructs and controls (IgkCl 5 IgkSl-1, IgkSl-2, IgkSl-3, 
IgkSl-4 and IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated 
5 into the Kpnl and EcoKL sites of pCDNA3 . 

[0025] Figure 2 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted ARG E7 constructs and controls (IgkSl-5 ? IgkSl-6, IgkSl-7, IgkSl-8, 
IgkSl-9, IgkSl-10, IgkCl and IgkC2) as further defined in Example 1 and Table 12. The 
sequences are ligated into the Kpnl and EcoKL sites of pCDNA3. 

10 [0026] Figure 3 is a diagrammatic representation depicting a nucleotide sequence 

alignment of secreted ASN and LYS E7 constructs and controls (IgkCl, IgkSl-12, IgkSl-31 
and IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated into the 
Kpnl and EcoKL sites of pCDNA3. 

[0027] Figure 4 is a diagrammatic representation depicting a nucleotide sequence 
15 alignment of secreted ASP E7 constructs and controls (IgkCl, IgkSl-13 ? IgkSl-14 and 

IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl 
and EcoKL sites of pCDNA3. 

[0028] Figure 5 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted CYS E7 constructs and controls (IgkCl, IgkSl~15, IgkSl-16 and 
20 IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl 
and EcoKL sites of pCDNA3. 

[0029] Figure 6 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted GLU E7 constructs and controls (IgkSl-17, IgkSl-18 ? IgkC2 and 
IgkCl) as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl 
25 and EcoKl sites of pCDNA3. 

[0030] Figure 7 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted GLN E7 constructs and controls (IgkCl, IgkSl-19, IgkSl-20 and 
IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl 
and EcoKL sites of pCDNA3. 

30 [0031] Figure 8 is a diagrammatic representation depicting a nucleotide sequence 

alignment of secreted GLY E7 constructs and controls (IgkCl, IgkSl-21, IgkSl-22, IgkSl-23, 
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IgkSl-24 and IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated 
into the Kpnl and EcoKL sites of pCDNA3. 

[0032] Figure 9 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted HIS E7 constructs and controls (IgkCl, IgkSl-25, IgkSl-26 and IgkC2) 
5 as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl and 
EcoKL sites of pCDNA3. 

[0033] Figure 10 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted ILE E7 constructs and controls (IgkCl, IgkSl-27, IgkSl~28, IgkSl-29 
and IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated into the 
1 0 Kpnl and EcoKL sites of pCDNA3 . 

[0034] Figure 1 1 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted LEU E7 constructs and controls (IgkSl-50 ? IgkSl-51, IgkSl-52, IgkSl- 
53, IgkSl-54, IgkSl-55, IgkC3 and IgkC4) as further defined in Example 1 and Table 12. The 
sequences are ligated into the Kpnl and EcoKL sites of pCDNA3. The LEU E7 constructs are 
1 5 oncogenic (/. e. , encode wild-type E7 protein). 

[0035] Figure 12 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted PHE E7 constructs and controls (IgkSl-32, IgkSl-33, IgkCland IgkC2) 
as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl and 
EcoKL sites of pCDNA3. Two LEU residues were mutated to PHE in this sequence so that 
20 there are three instead of one PHE residue. 

[0036] Figure 13 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted PRO E7 constructs and controls (IgkSl~56, IgkSl-57, IgkSl~58, IgkSl- 
59, IgkC3 and IgkC4) as further defined in Example 1 and Table 12. The sequences are 
ligated into the Kpnl and EcoKL sites of pCDNA3. The PRO E7 constructs are oncogenic (i.e., 
25 encode wild-type E7 protein). 

[0037] Figure 14 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted SER E7 constructs and controls (IgkSl-34, IgkSl-35, IgkSl-36, IgkSl- 
37 5 IgkSl-38 5 IgkSl-39, IgkCland IgkC2) as further defined in Example 1 and Table 12. The 
sequences are ligated into the Kpnl and EcoKL sites of pCDNA3. 

30 [0038] Figure 15 is a diagrammatic representation depicting a nucleotide sequence 

alignment of secreted THRE7 constructs and controls (IgkCl, IgkSl-40, IgkSl-41, IgkSl-42, 
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IgkSl-43 and IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated 
into the Kpnl and EcoRl sites of pCDNA3. 

[0039] Figure 16 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted TYR E7 constructs and controls (IgkCl, IgkSl~44, IgkSl-45 and 
5 IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated into the Kpnl 
and EcoKL sites of pCDNA3. 

[0040] Figure 17 is a diagrammatic representation depicting a nucleotide sequence 
alignment of secreted VAL E7 constructs and controls (IgkCl, IgkSl-46, IgkSl-47, IgkSl-48, 
IgkSl-49 and IgkC2) as further defined in Example 1 and Table 12. The sequences are ligated 
10 into the Kpnl and EcoKL sites of pCDNA3. 

[0041] Figure 18 is a graphical representation showing the response to gene gun 
immunization with optimized and de-optimized E7 constructs measured by (a) ELIS A, (b) 
Memory B cell ELISPOT, and (c) IFN-y ELISPOT. For part (a) eight mice were immunized 
per group (4 immunizations, 3 weeks apart) and the sera taken three weeks after the final 

15 immunization; (left) E7 protein ELISA, (right) E7 peptide 101 ELISA. Wells were done in 

duplicate. For parts (b) and (c) mice were immunized twice, three weeks apart and the spleens 
collected three weeks after the second immunization. The spleens were pooled prior to 
analysis. The Memory B cell and IFN-y ELISPOTs were conducted twice and three times, 
respectively, and the wells done in triplicate. Three mice were used per group per repeat. The 

20 results shown in parts (b) and (c) are from individual experiments and are representative of 
the complete data sets. The particular ELISPOT experimental data included here were 
gathered together with the corresponding data in Figure 20 and therefore may be directly 
compared. Unpaired two-tailed t-tests were used to compare the modified constructs to wild- 
type. *** PO.001, ** 0.001<P<0.01, * 0.01<P<0.05, ns= not significant (P>0.05). In (a) Ol- 

25 03 were not significantly different from MC as measured by unpaired two-tailed t-tests. wt= 
wild-type codon usage E7; 01-03= codon-optimized E7 constructs 1 to 3; W=codon de- 
optimized E7; MC= mammalian consensus codon usage E7. 

[0042] Figure 19 is a graphical representation showing the response to 
immunization by intradermal injection with optimized and de-optimized constructs measured 
30 by (a) ELISA, (b) Memory B cell ELISPOT, and (c) IFN-y ELISPOT. For part (a) eight mice 
were immunized per group (4 immunizations, 3 weeks apart) and the sera taken three weeks 
after the final immunization; (left) E7 protein ELISA, (right) E7 peptide 101 ELISA. Wells 
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were done in duplicate. For parts (b) and (c) mice were immunized twice, three weeks apart 
and the spleens collected three weeks after the second immunization. The spleens were pooled 
prior to analysis. The Memory B cell and IFN-y ELISPOTs were conducted twice and three 
times, respectively, and the wells done in triplicate. Three mice were used per group per 
5 repeat. The results shown in parts (b) and (c) are from individual experiments and are 

representative of the complete data sets. The particular ELISPOT experimental data included 
here were gathered together with the corresponding data in Figure 20 and therefore may be 
directly compared. Unpaired two-tailed t-tests were used to compare the modified constructs 
to wild-type. *** PO.001, ** 0.001<P<0.01, * 0.01<P<0.05, ns= not significant (P>0,05). In 
10 (a) Ol-OS were not significantly different from MC as measured by unpaired two-tailed t- 
tests. wt= wild-type codon usage E7; 01-03= codon-optimized E7 constructs 1 to 3; 
W=codon de-optimized E7; MC= mammalian consensus codon usage E7. 

[0043] Figure 20 is a graphical representation showing the results of an ELIS A that 
measures binding of serum from mice immunized with various gD2 constructs by intradermal 
15 injection (white bars) or gene gun immunization (black bars), to C -terminally His-tagged 
gD2tr. Note that the His-tagged gD2tr protein was used in an unpurified state (in CHO cell 
supernatant) and that background readings of non-specific binding to control supernatant have 
been subtracted from the results. 
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TABLE 8 

Brief Description of the Sequences 



SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


cpn TO 7SJO- 1 


1 rr]s C 9_ 1 *3 A cr> f^r A T AADSfTJIAt Dl )A J Aof~i A^ CAA J J AA AA 
X^lVk3Z. 1 -J AV&jy \J A 1 vlrilotl livl IIUC-ICV/ CiCiV oV^Cj CivXICt/ 




Qpo TD NO* 9 


TcrV£k9-14 A SA OAO POTlStri lPt TillflprvH H A SPAllPnCP 


387 nts 

JO / 11 to 




Trrl#r^0 1 ^ Pvc TOT prmstmict miplpotiHp spaiiptipp 

IglVOZ,- 1 J L'JO 1 VJ 1 CtJlloU Ul/l UUVieVJllLlC oGLJUC<llV'C< 


387 nts 




1 f\ Ovs TfiP pnn <vtn i ct nuplpAtidp spniiPnpp 


387 nts 




ToTr^^-l 7 0111 OAO nnnQtnirt nnclpntiHp spnupncp 


387 nts 


cpn TD MO- 6 


T»lr^9- 1 R Oln OA A construct nucleotide seauence 


387 nts 


cpn td xrn- 7 


TctV^9-1Q Oln OAO ponstmpt nucleotide seauence 

IgJVVjZi 1 7 VJ 111 V^rVVJ V-'VJllO Ll 1 11U1V1V^\JL1U.V^ O^^UVIXVW 


387 nts 


orl/V^ IjL/ iNU. O 


frrlr^I9-'7n OJn OA A mn<itnirf nucleotide* seauence 

lUKuZr Z>\/ VJ 111 V_/A_r\ UUlldllUVl lltiwi^/V/llvl^' owvji^^/ix^w 


387 nts 


cun TD XIO- Q 


ToT^^I9 91 Olv OOO construct nucleotide seauence 

1 LlJvo 1 VJ 1 V Vj VJ VJ L/ WHO LI UlC'L 11 UV^l \s\J OVVJUVIXW 


387 nts 


qua td xto» i n 

oc,v<J ID iS\J. 1U 


TrrlrQ7 99 Olv OOA construct nucleotide seauence 


387 nts 


oi3v<J YD JNw. 1 1 


TitVQ9 9^ Olv OOT rAnQtnirt nuclpntiHp seauence 

lPK.^Zr~Z.-3 VJ1 V VJ VJ 1 Uwll^ll Llv^L iitiwit-Aj Ll\Jt/ ovv[uviiw 


387 nts 




TrTlrQ9 OA Olv OOO PAn<;triirt niiclpotirlp seauence 


387 nts 


CPA TH \Tf\- 1 "X 


TrrVQ9 97 Tip ATA pAn<itriirt niiclpotidp seauence 


387 nts 


CPA TPt \TA. 1 A 

oh/Q 1U JNU. 14 


TrrVQ9 9R Tip ATT pAn<5triirt rmcleotirlp seauence 


387 nts 


oiz,v>J ID rnKJ. 1 J 


TrrL-^9 90 Tip A TO rnn^trnrf nucleoftde ^eaUGTlCG 

lPjVO-<lr""^->' 11C AY 1 V^ L/WllollUV^l HUVIVUUU^ ovvjuviivv 


387 nts 


oby ID FS\J. ID 


TrrV<i9 id ^Ipr AOT rnn struct nucleotide seauence 

liiiN.LjZ J" Ovl AVJ 1 L/WllkJtl LlwL 11UV1VUUUV ov^wvuvv 


387 nts 


cr?A TD XTfV 1 1 
oliv^ ID InvJ. 1 / 


TrrVQ9 ^spr AOO pnn struct nucleotide seauence 


387 nts 


crro td XT Or- 1 ft 


TrrVQ9 <spr TOO construct nucleotide seauence 


387 nts 


CCA TD XTfV 1 Q 

ofcA^J id INC iy 


TrrVQ9 'Xi Qpr TO A construct nucleotide seauence 


387 nts 


SEO ID NO* 20 


IgkS2-38 Ser TCT construct nucleotide sequence 


387 nts 


SEQ ID NO: 21 


IgkS2-39 Ser TCC construct nucleotide sequence 


387 nts 


SEQ ID NO: 22 


IgkS2-40 Thr ACG construct nucleotide sequence 


387 nts 


SEQ ID NO: 23 


IgkS2-41 Thr AC A construct nucleotide sequence 


387 nts 


SEQ ID NO: 24 


IgkS2-42 Thr ACT construct nucleotide sequence 


387 nts 


SEQ ID NO: 25 


IgkS2-43 Thr ACC construct nucleotide sequence 


387 nts 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


SEQ ID NO: 26 


IgkS2-46 Val GTG construct nucleotide sequence 


387 nts 


SEQ ID NO: 27 


IgkS2-47 Val GTA construct nucleotide sequence 


387 nts 


SEQ ID NO: 28 


IgkS2-48 Val GTT construct nucleotide sequence 


387 nts 


SEQ ID NO: 29 


IgkS2-49 Val GTG construct nucleotide sequence 


387 nts 


SEQ ID NO: 30 


IgkS2-l Ala GCG Linker nucleotide sequence 


408 nts 


SEQ ID NO: 31 


IgkS2-2 Ala GCA Linker nucleotide sequence 


408 nts 


SEQ ID NO: 32 


IgkS2-3 Ala GCT Linker nucleotide sequence 


408 nts 


SEQ ID NO: 33 


IgkS2-4 Ala GCC Linker nucleotide sequence 


408 nts 


SEQ ID NO: 34 


IgkS2-5 Arg AGG Linker nucleotide sequence 


408 nts 


SEQ ID NO: 35 


IgkS2-6 Arg AGA Linker nucleotide sequence 


408 nts 


SEQ ID NO: 36 


IgkS2-7 Arg CGG Linker nucleotide sequence 


408 nts 


SEQ ID NO: 37 


IgkS2-8 Arg CGA Linker nucleotide sequence 


408 nts 


SEQ ID NO: 38 


IgkS2-9 Arg CGT Linker nucleotide sequence 


408 nts 


SEQ ID NO: 39 


IgkS2- 10 Arg CGC Linker nucleotide sequence 


408 nts 


SEQ ID NO: 40 


IgkS2-l 1 Asn AAT Linker nucleotide sequence 


408 nts 


SEQ ID NO: 41 


IgkS2-12 Asn AAC Linker nucleotide sequence 


408 nts 


SEQ ID NO: 42 


IgkS2-25 His CAT Linker nucleotide sequence 


408 nts 


SEQ ID NO: 43 


IgkS2-26 His CAC Linker nucleotide sequence 


408 nts 


SEQ ID NO: 44 


IgkS2-30 Lys AAG Linker nucleotide sequence 


408 nts 


SEQ ID NO: 45 


IgkS2-31 Lys AAA Linker nucleotide sequence 


408 nts 


SEQ ID NO: 46 


IgkS2-32 Phe TTT Linker nucleotide sequence 


408 nts 


SEQ ID NO: 47 


IgkS2-33 Phe TTC Linker nucleotide sequence 


408 nts 


SEQ ID NO: 48 


IgkS2-44 Tyr TAT Linker nucleotide sequence 


408 nts 


SEQ ID NO: 49 


IgkS2-45 Tyr TAC Linker nucleotide sequence 


408 nts 


SEQ ID NO: 50 


Influenza A Virus HA hemagglutinin (A/Hong 
Kong/21 3/03(H5Nl)) BAE07201 wild-type 


1707 nts 


SEQ ID NO: 51 


Influenza A Virus HA hemagglutinin (A/Hong 
Kong/2 13/03(H5N1)) BAE07201 wild-type 


568 aa 


SEQ ID NO: 52 


Influenza A Virus HA hemagglutinin (A/Hong 


1707 nts 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 




Kong/213/03(H5N1)) Codon modified 




SEQ ID NO: 53 


Influenza A Virus HA hemagglutinin 
(A/swine/Korea/PZ72-l/2006 (H3N1)) DQ923506 
wild-type 


1701 nts 


SEQ ID NO: 54 


Influenza A Virus HA hemagglutinin 
(A/swine/Korea/PZ72- 1/2006 (H3N1)) DQ923506 
wild-type 


566 aa 


SEQ ID NO: 55 


Influenza A Virus HA hemagglutinin 
(A/swine/Korea/PZ72-l/2006 (H3N1)) Codon 
modified 


1701 nts | 


SEQ ID NO: 56 


Influenza A Virus NA neuraminidase (A/Hong 
Kong/2 1 3/03(H5N 1 )) AB2 12056 wild-type 


1410 nts 


SEQ ID NO: 57 


Influenza A Virus NA neuraminidase (A/Hong 
Kong/2 13/03(H5N1)) AB2 12056 wild-type 


469 aa 


SEQ ID NO: 58 


Influenza A Virus NA neuraminidase (A/Hong 
Kong/213/03(H5N1)) Codon modified 


1410 nts 


SEQ ID NO: 59 


Influenza A Virus NA neuraminidase 
(A/swine/MI/PU243/04 (H3N1)) DQ 150427 wild-type 


1410 nts 


SEQ ID NO: 60 


Influenza A Virus NA neuraminidase 
(A/swine/MI/PU243/04 (H3N1)) DQ 150427 wild-type 


469 aa 


SEQ ID NO: 61 


Influenza A Virus NA neuraminidase 
(A/swine/MI/PU243/04 (H3N1)) Codon modified 


1410 nts 


SEQ ID NO: 62 


Hepatitis C Virus El (Serotype 1A, isolate H77) 
AF009606 wild-type 


576 nts 


SEQ ID NO: 63 


Hepatitis C Virus El (Serotype 1A, isolate H77) NP 
751920 wild-type 


192 aa 


SEQ ID NO: 64 


Hepatitis C Virus El (Serotype 1 A, isolate H77) Codon 
modified 


576 nts 


SEQ ID NO: 65 


Hepatitis C Virus E2 (Serotype 1A, isolate H77) 
AF009606 wild-type 


1089 nts 


SEQ ID NO: 66 


Hepatitis C Virus E2 (Serotype 1A, isolate H77) NP 
751921 wild-type 


363 aa 


SEQ ID NO: 67 


Hepatitis C Virus E2 (Serotype 1A, isolate H77) Codon 
modified 


1089 nts 


SEQ ID NO: 68 


Epstein Barr Virus (Type 1, gp350 B95-8) NC 007605 
wild-type 


2724 nts 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


SEQ ID NO: 69 


Epstein Ban Virus (Type 1, gp350 B95-8) CAD53417 
wild-type 


907 aa 


SEQ ID NO: 70 


Epstein Barr Virus (Type 1, gp350 B95-8) Codon 
modified 


2724 nts 


SEQ ID NO: 71 


Epstein Barr Virus (Type 2, gp350 AG876) NC 009334 
wild-type 


2661 nts 


SEQ ID NO: 72 


Epstein Barr Virus (Type 2, gp350 AG876) YP 
001129462 wild-type 


886 aa 


SEQ ID NO: 73 


Epstein Barr Virus (Type 2, gp350 AG876) Codon 
Modified 


2661 nts 


SEQ ID NO: 74 


Herpes Simplex Virus 2 (Glycoprotein B strain HG52) 
NC 001798 wild-type 


2715 nts 


SEQ ID NO: 75 


Herpes Simplex Virus 2 (Glycoprotein B strain HG52) 
CAB06752 wild-type 


904 aa 


SEQ ID NO: 76 


Herpes Simplex Virus 2 (Glycoprotein B strain HG52) 
Codon modified 


2715 nts 


SEQ ID NO: 77 


Herpes Simplex Virus (Glycoprotein D strain HG52) 
NC 001798 wild-type 


1182 nts 


SEQ ID NO: 78 


Herpes Simplex Virus (Glycoprotein D strain HG52) 
NP 0044536 wild-type 


393 aa 


SEO ID NO - 79 


Herpes Simplex Virus (Glycoprotein D strain HG52) 
Codon modified 


1 182 nts 


SEQ ID NO: 80 


HPV-16E7 wild-type 


387 nts 


SEQ ID NO: 81 


HPV-16E7 01 


387 nts 


SEQ ID NO: 82 


HPV-16 E7 02 


387 nts 


SEQ ID NO: 83 


HPV-16E7 03 


417 nts 


SEQ ID NO: 84 


HPV-16 E7W 


387 nts 


SEQ ID NO: 85 


HSV-2 gD2 wild-type 


11 82 nts 


SEQ ID NO: 86 


HSV-2 gD2 Ol 


1182 nts 


SEQ ID NO: 87 


HSV-2 gD2 02 


11 82 nts 


SEQ ID NO: 88 


HSV-2 gD2 03 


11 82 nts 


SEQ ID NO: 89 


HSV-2 gD2 W 


11 82 nts 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


SEQ ID NO: 90 
SEQ ID NO: 91 


Common forward primer 
ODN-7909 


41 nts 
24 nts 
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DETAILED DESCRIPTION OF THE INVENTION 

1. Definitions 

[0044] Unless defined otherwise, all technical and scientific terms used herein have 
the same meaning as commonly understood by those of ordinary skill in the art to which the 
5 invention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, preferred 
methods and materials are described. For the purposes of the present invention, the following 
terms are defined below. 

[0045] The articles "a" and "an" are used herein to refer to one or to more than one 
10 (i.e. to at least one) of the grammatical object of the article. By way of example, "an element" 
means one element or more than one element. 

[0046] By "about" is meant a quantity, level, value, frequency, percentage, 
dimension, size, or amount that varies by no more than 15%, and preferably by no more than 
10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% to a reference quantity, level, value, frequency, 
15 percentage, dimension, size, or amount. 

[0047] The terms "administration concurrently" or "administering concurrently" or 
"co-administering" and the like refer to the administration of a single composition containing 
two or more actives, or the administration of each active as separate compositions and/or 
delivered by separate routes either contemporaneously or simultaneously or sequentially 

20 within a short enough period of time that the effective result is equivalent to that obtained 

when all such actives are administered as a single composition. By "simultaneously" is meant 
that the active agents are administered at substantially the same time, and desirably together in 
the same formulation. By "contemporaneously" it is meant that the active agents are 
administered closely in time, e.g., one agent is administered within from about one minute to 

25 within about one day before or after another. Any contemporaneous time is useful. However, 
it will often be the case that when not administered simultaneously, the agents will be 
administered within about one minute to within about eight hours and preferably within less 
than about one to about four hours. When administered contemporaneously, the agents are 
suitably administered at the same site on the subject. The term "same site" includes the exact 

30 location, but can be within about 0.5 to about 15 centimeters, preferably from within about 
0.5 to about 5 centimeters. The term "separately" as used herein means that the agents are 
administered at an interval, for example at an interval of about a day to several weeks or 
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months. The active agents may be administered in either order. The term "sequentially" as 
used herein means that the agents are administered in sequence, for example at an interval or 
intervals of minutes, hours, days or weeks. If appropriate the active agents may be 
administered in a regular repeating cycle. 

5 [0048] As used herein, the term "exacting sequence" or "czs-regulatory region" or 

similar term shall be taken to mean any sequence of nucleotides which is derived from an 
expressible genetic sequence wherein the expression of the genetic sequence is regulated, at 
least in part, by the sequence of nucleotides. Those skilled in the art will be aware that a cis- 
regulatory region may be capable of activating, silencing, enhancing, repressing or otherwise 
10 altering the level of expression and/or cell-type-specificity and/or developmental specificity 
of any structural gene sequence. 

[0049] Throughout this specification, unless the context requires otherwise, the 
words "comprise," "comprises" and "comprising" will be understood to imply the inclusion 
of a stated step or element or group of steps or elements but not the exclusion of any other 
15 step or element or group of steps or elements. 

[0050] As used herein, a "chimeric construct" refers to a polynucleotide having 
heterologous nucleic acid elements. Chimeric constructs include "expression cassettes" or 
"expression constructs," which refer to an assembly that is capable of directing the expression 
of the sequence(s) or gene(s) of interest. An expression cassette generally includes control 

20 elements such as a promoter that is operably linked to (so as to direct transcription of) a 

synthetic polynucleotide of the invention, and often includes a polyadenylation sequence as 
well. Within certain embodiments of the invention, the chimeric construct may be contained 
within a vector. In addition to the components of the chimeric construct, the vector may 
include, one or more selectable markers, a signal which allows the vector to exist as single- 

25 stranded DNA (e.g., a Ml 3 origin of replication), at least one multiple cloning site, and a 
"mammalian" origin of replication (e.g., a SV40 or adenovirus origin of replication). 

[0051] As used herein, "conferred immune response," "immune response that is 

conferred" and the like refer to a temporary or permanent change in immune response to a 

target antigen, which occurs or would occur after the introduction of a polynucleotide to the 

30 mammal, and which would not occur in the absence of that introduction. Typically, such a 

temporary or permanent change occurs as a result of the transcription and/or translation of 

genetic information contained within that polynucleotide in a cell, or in at least one cell or cell 

type or class of cell within a mammal or within a class of mammals, and can be used to 
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distinguish the mammal, or class of mammals to which the polynucleotide has been provided 
from a similar mammal, or class of mammals, to which the polynucleotide has not been 
provided. 

[0052] By "corresponds to" or "corresponding to" is meant an antigen which 
5 encodes an amino acid sequence that displays substantial similarity to an amino acid sequence 
in a target antigen. In general the antigen will display at least about 30, 40, 50, 55, 60, 65, 70, 
75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 % similarity or identity to at least a portion of 
the target antigen {e.g., at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% of 
the amino acid sequence of the target antigen). 

1 0 [0053] By "effective amount, " in the context of modulating an immune response or 

treating or preventing a disease or condition, is meant the administration of that amount of 
composition to an individual in need thereof, either in a single dose or as part of a series, that 
is effective for achieving that modulation, treatment or prevention. The effective amount will 
vary depending upon the health and physical condition of the individual to be treated, the 

15 taxonomic group of individual to be treated, the formulation of the composition, the 

assessment of the medical situation, and other relevant factors. It is expected that the amount 
will fall in a relatively broad range that can be determined through routine trials. 

[0054] The terms "enhancing an immune response," "producing a stronger immune 
response" and the like refer to increasing an animal' s capacity to respond to a target antigen 

20 (e.g. , a foreign or disease-specific antigen or a self antigen), which can be determined for 

example by detecting an increase in the number, activity, and ability of the animal's cells that 
are primed to attack such antigens or an increase in the titer or activity of antibodies in the 
animal, which are immuno-interactive with the target antigen. Strength of immune response 
can be measured by standard immunoassays including: direct measurement of antibody titers 

25 or peripheral blood lymphocytes; cytolytic T lymphocyte assays; assays of natural killer cell 
cytotoxicity; cell proliferation assays including lymphoproliferation (lymphocyte activation) 
assays; immunoassays of immune cell subsets; assays of T-lymphocytes specific for the 
antigen in a sensitized subject; skin tests for cell-mediated immunity; etc. Such assays are 
well known in the art. See, e.g., Erickson et al 9 1993, J. Immunol. 151 :4189-4199; Doe et al 9 

30 1994, Eur. J. Immunol. 24:2369-2376. Recent methods of measuring cell-mediated immune 
response include measurement of intracellular cytokines or cytokine secretion by T-cell 
populations, or by measurement of epitope specific T-cells (e.g., by the tetramer technique) 
(reviewed by McMichael, A. J., and O'Callaghan, C. A., 1998, J. Exp. Med. 187(9)1367- 
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1371; Mcheyzer- Williams, M. G., etal, 1996, Immunol. Rev. 150:5-21; Lalvani, A., etal, 
1997, J. Exp. Med. 186:859-865). Any statistically significant increase in strength of immune 
response as measured for example by immunoassay is considered an "enhanced immune 
response" or "immunoehhancement" as used herein. Enhanced immune response is also 
5 indicated by physical manifestations such as fever and inflammation, as well as healing of 
systemic and local infections, and reduction of symptoms in disease, i.e., decrease in tumor 
size, alleviation of symptoms of a disease or condition including, but not restricted to, leprosy, 
tuberculosis, malaria, naphthous ulcers, herpetic and papillomatous warts, gingivitis, 
arthrosclerosis, the concomitants of AIDS such as Kaposi's sarcoma, bronchial infections, and 
10 the like. Such physical manifestations also encompass "enhanced immune response" or 
"immunoenhancement" as used herein. By contrast, "reducing an immune response," 
"producing a weaker immune response" and the like refer to decreasing an animaPs capacity 
to respond to a target antigen, which can be determined for example by conducting 
immunoassays or assessing physical manifestations, as described for example above. 

15 [0055] The terms "expression" or "gene expression" refer to production of RNA 

message and/or translation of RNA message into proteins or polypeptides. 

[0056] By "expression vector" is meant any autonomous genetic element capable 
of directing the synthesis of a protein encoded by the vector. Such expression vectors are 
known by practitioners in the art. 

20 [0057] The term "gene" is used in its broadest context to include both a genomic 

DNA region corresponding to the gene as well as a cDNA sequence corresponding to exons 
or a recombinant molecule engineered to encode a functional form of a product. 

[0058] As used herein the term "heterologous" refers to a combination of elements 
that are not naturally occurring or that are obtained from different sources. 

25 [0059] "Immune response" or "immunological response" refers to the concerted 

action of lymphocytes, antigen-presenting cells, phagocytic cells, granulocytes, and soluble 
macromolecules produced by the above cells or the liver (including antibodies, cytokines, and 
complement) that results in selective damage to, destruction of, or elimination from the body 
of cancerous cells, metastatic tumor cells, metastatic breast cancer cells, invading pathogens, 

30 cells or tissues infected with pathogens, or, in cases of autoimmunity or pathological 

inflammation, normal human cells or tissues. In some embodiments, an "immune response' 
encompasses the development in an individual of a humoral and/or a cellular immune 
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response to a polypeptide that is encoded by an introduced synthetic polynucleotide of the 
invention. As known in the art, the terms "humoral immune response" includes and 
encompasses an immune response mediated by antibody molecules, while a "cellular immune 
response" includes and encompasses an immune response mediated by T-lymphocytes and/or 
5 other white blood cells. Thus, an immune response that is stimulated by a synthetic 

polynucleotide of the invention may be one that stimulates the production of antibodies (e.g., 
neutralizing antibodies that block bacterial toxins and pathogens such as viruses entering cells 
and replicating by binding to toxins and pathogens, typically protecting cells from infection 
and destruction). The synthetic polynucleotide may also elicit production of cytolytic T 

10 lymphocytes (CTLs). Hence, an immunological response may include one or more of the 

following effects: the production of antibodies by B-cells; and/or the activation of suppressor 
T-cells and/or memory/effector T-cells directed specifically to an antigen or antigens present 
in the composition or vaccine of interest. In some embodiments, these responses may serve to 
neutralize infectivity, and/or mediate antibody-complement, or antibody dependent cell 

15 cytotoxicity (ADCC) to provide protection to an immunized host. Such responses can be 
determined using standard immunoassays and neutralization assays, well known in the art. 
(See, e.g., Montefiori et al, 1988, J Clin Microbiol. 26:231-235; Dreyer et al, 1999, AIDS 
Res Hum Retroviruses 15(1 7) :15 63 -1571). The innate immune system of mammals also 
recognizes and responds to molecular features of pathogenic organisms and cancer cells via 

20 activation of Toll-like receptors and similar receptor molecules on immune cells. Upon 
activation of the innate immune system, various non-adaptive immune response cells are 
activated to, e.g., produce various cytokines, lymphokines and chemokines. Cells activated by 
an innate immune response include immature and mature dendritic cells of, for example, the 
monocyte and plamsacytoid lineage (MDC, PDC), as well as gamma, delta, alpha and beta T 

25 cells and B cells and the like. Thus, the present invention also contemplates an immune 
response wherein the immune response involves both an innate and adaptive response. 

[0060] A composition is "immunogenic" if it is capable of either: a) generating an 
immune response against a target antigen (e.g., a viral or tumor antigen) in an individual; or b) 
reconstituting, boosting, or maintaining an immune response in an individual beyond what 
30 would occur if the agent or composition was not administered. An agent or composition is 

immunogenic if it is capable of attaining either of these criteria when administered in single or 
multiple doses. 
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[0061] "Immunomodulation," modulating an immune response" and the like refer 
to the modulation of the immune system in response to a stimulus and includes increasing or 
decreasing an immune response to a target antigen or changing an immune response from one 
that is predominantly a humoral immune response to one that is a more cell-mediated immune 
5 response and vice versa. For example, it is known in the art that decreasing the amount of 
antigen for immunization can change the bias of the immune system from a predominantly 
humoral immune response to a predominantly cellular immune response. 

[0062] By "isoaccepting transfer RNA" or "iso-tRNA" is meant one or more 
transfer RNA molecules that differ in their anticodon nucleotide sequence but are specific for 
10 the same amino acid. 

[0063] As used herein, the term "mammal" refers to any mammal including, 
without limitation, humans and other primates, including non-human primates such as 
chimpanzees and other apes and monkey species; farm animals such as cattle, sheep, pigs, 
goats and horses; domestic mammals such as dogs and cats; and laboratory animals including 
15 rodents such as mice, rats and guinea pigs. The term does not denote a particular age. Thus, 
both adult and newborn individuals are intended to be covered. 

[0064] By "modulating," "modulate" and the like is meant increasing or 
decreasing, either directly or indirectly, the quality of a selected phenotype (e.g., an immune 
response). In certain embodiments, "modulation" or "modulating" means that a 

20 desired/selected immune response is more efficient (e.g., at least 10%, 20%, 30%o, 40%, 50%, 
60% or more), more rapid (e.g., at least 10%, 20%, 30%, 40%, 50%, 60% or more), greater in 
magnitude (e.g., at least 10%, 20%, 30%, 40%, 50%, 60% or more), and/or more easily 
induced (e.g., at least 10%, 20%, 30%, 40%, 50%, 60% or more) than if the parent 
polynucleotide had been used under the same conditions as the synthetic polynucleotide. In 

25 other embodiments, "modulation" or "modulating" means changing an immune response 
from a predominantly antibody-mediated immune response as conferred by the parent 
polynucleotide, to a predominantly cellular immune response as conferred by the synthetic 
polynucleotide under the same conditions. In still other embodiments, "modulation" or 
"modulating" means changing an immune response from a predominantly cellular immune 

30 response as conferred by the parent polynucleotide, to a predominantly antibody-mediated 
immune response as conferred by the synthetic polynucleotide under the same conditions. 
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[0065] By "natural gene" is meant a gene that naturally encodes the protein. 
However, it is possible that the parent polynucleotide encodes a protein that is not naturally- 
occurring but has been engineered using recombinant techniques. 

[0066] The term "5 5 non-coding region" is used herein in its broadest context to 
5 include all nucleotide sequences which are derived from the upstream region of an expressible 
gene, other than those sequences which encode amino acid residues which comprise the 
polypeptide product of the gene, wherein 5' non-coding region confers or activates or 
otherwise facilitates, at least in part, expression of the gene. 

[0067] The term "oligonucleotide" as used herein refers to a polymer composed of 
10 a multiplicity of nucleotide units (deoxyribonucleotides or ribonucleotides, or related 

structural variants or synthetic analogues thereof) linked via phosphodiester bonds (or related 
structural variants or synthetic analogues thereof). Thus, while the term "oligonucleotide" 
typically refers to a nucleotide polymer in which the nucleotides and linkages between them 
are naturally occurring, it will be understood that the term also includes within its scope 
15 various analogues including, but not restricted to, peptide nucleic acids (PNAs), 

phosphoramidates, phosphorothioates, methyl phosphonates, 2-O-methyl ribonucleic acids, 
and the like. The exact size of the molecule may vary depending on the particular application. 
An oligonucleotide is typically rather short in length, generally from about 10 to 30 
nucleotides, but the term can refer to molecules of any length, although the term 
20 "polynucleotide" or "nucleic acid" is typically used for large oligonucleotides. 

[0068] The terms "operably connected," "operably linked" and the like as used 
herein refer to an arrangement of elements wherein the components so described are 
configured so as to perform their usual function. Thus, a given promoter operably linked to a 
coding sequence is capable of effecting the expression of the coding sequence when the 

25 proper enzymes are present. The promoter need not be contiguous with the coding sequence, 
so long as it functions to direct the expression thereof. Thus, for example, intervening 
untranslated yet transcribed sequences can be present between the promoter sequence and the 
coding sequence and the promoter sequence can still be considered "operably linked" to the 
coding sequence. Terms such as "operably connected," therefore, include placing a structural 

30 gene under the regulatory control of a promoter, which then controls the transcription and 
optionally translation of the gene. In the construction of heterologous promoter/structural 
gene combinations, it is generally preferred to position the genetic sequence or promoter at a 
distance from the gene transcription start site that is approximately the same as the distance 
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between that genetic sequence or promoter and the gene it controls in its natural setting; i.e. 
the gene from which the genetic sequence or promoter is derived. As is known in the art, 
some variation in this distance can be accommodated without loss of function. Similarly, the 
preferred positioning of a regulatory sequence element with respect to a heterologous gene to 
5 be placed under its control is defined by the positioning of the element in its natural setting; 
i.e., the genes from which it is derived. 

[0069] By "pharmaceutically-acceptable carrier" is meant a solid or liquid filler, 
diluent or encapsulating substance that may be safely used in topical or systemic 
administration. 

10 [0070] The term "phenotype" means any one or more detectable physical or 

functional characteristics, properties, attributes or traits of an organism, tissue, or cell, or class 
of organisms, tissues or cells, which generally result from the interaction between the genetic 
makeup (i.e., genotype) of the organism, tissue, or cell, or the class of organisms, tissues or 
cells and the environment. 

1 5 [0071] By "phenotypic preference" is meant the preference with which an organism 

uses a codon to produce a selected phenotype. This preference can be evidenced, for example, 
by the quality of a selected phenotype that is producible by a polynucleotide that comprises 
the codon in an open reading frame which codes for a polypeptide that produces the selected 
phenotype. In certain embodiment, the preference of usage is independent of the route by 

20 which the polynucleotide is introduced into the organism. However, in other embodiments, 
the preference of usage is dependent on the route of introduction of the polynucleotide into 
the organism. 

[0072] The term "polynucleotide" or "nucleic acid" as used herein designates 
mRNA, RNA, cRNA, cDNA or DNA. The term typically refers to oligonucleotides greater 
25 than 30 nucleotides in length. 

[0073] "Polypeptide," "peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues and to variants and synthetic analogues of the same. 
Thus, these terms apply to amino acid polymers in which one or more amino acid residues is a 
synthetic non-naturally occurring amino acid, such as a chemical analogue of a corresponding 
30 naturally occurring amino acid, as well as to naturally-occurring amino acid polymers. As 
used herein, the terms "polypeptide," "peptide" and "protein" are not limited to a minimum 
length of the product. Thus, peptides, oligopeptides, dimers, multimers, and the like, are 
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included within the definition. Both full-length proteins and fragments thereof are 
encompassed by the definition. The terms also include post expression modifications of a 
polypeptide, for example, glycosylation, acetylation, phosphorylation and the like. In some 
embodiments, a "polypeptide" refers to a protein which includes modifications, such as 
5 deletions, additions and substitutions (generally conservative in nature), to the native 

sequence, so long as the protein maintains the desired activity. These modifications may be 
deliberate, as through site-directed mutagenesis, or may be accidental, such as through 
mutations of hosts which produce the proteins or errors due to PCR amplification. 

[0074] The terms "polypeptide variant," and "variant" refer to polypeptides that 
10 vary from a reference polypeptide by the addition, deletion or substitution (generally 

conservative in nature) of at least one amino acid residue. Typically, variants retain a desired 
activity of the reference polypeptide, such as antigenic activity in inducing an immune 
response against a target antigen. In general, variant polypeptides are "substantially similar" 
or substantially identical" to the reference polypeptide, e.g., amino acid sequence identity or 
15 similarity of more than 50%, generally more than 60%-70%, even more particularly 80%- 

85% or more, such as at least 90%-95% or more, when the two sequences are aligned. Often, 
the variants will include the same number of amino acids but will include substitutions, as 
explained herein. 

[0075] By "primer" is meant an oligonucleotide which, when paired with a strand 
20 of DNA, is capable of initiating the synthesis of a primer extension product in the presence of 
a suitable polymerizing agent. The primer is preferably single-stranded for maximum 
efficiency in amplification but may alternatively be double-stranded. A primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the 
polymerization agent. The length of the primer depends on many factors, including 
25 application, temperature to be employed, template reaction conditions, other reagents, and 
source of primers. For example, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15 to 35 or more nucleotides, although it may 
contain fewer nucleotides. Primers can be large polynucleotides, such as from about 200 
nucleotides to several kilobases or more. Primers may be selected to be "substantially 
30 complementary" to the sequence on the template to which it is designed to hybridize and 
serve as a site for the initiation of synthesis. By "substantially complementary", it is meant 
that the primer is sufficiently complementary to hybridize with a target nucleotide sequence. 
Preferably, the primer contains no mismatches with the template to which it is designed to 
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hybridize but this is not essential For example, non-complementary nucleotides may be 
attached to the 5' end of the primer, with the remainder of the primer sequence being 
complementary to the template. Alternatively, non-complementary nucleotides or a stretch of 
non-complementary nucleotides can be interspersed into a primer, provided that the primer 
5 sequence has sufficient complementarity with the sequence of the template to hybridize 
therewith and thereby form a template for synthesis of the extension product of the primer. 

[0076] Reference herein to a "promoter" is to be taken in its broadest context and 
includes the transcriptional regulatory sequences of a classical genomic gene, including the 
TATA box which is required for accurate transcription initiation, with or without a CCAAT 

10 box sequence and additional regulatory elements (i.e. upstream activating sequences, 

enhancers and silencers) which alter gene expression in response to developmental and/or 
environmental stimuli, or in a tissue-specific or cell-type-specific manner. A promoter is 
usually, but not necessarily, positioned upstream or 5\ of a structural gene, the expression of 
which it regulates. Furthermore, the regulatory elements comprising a promoter are usually 

15 positioned within 2 kb of the start site of transcription of the gene. Preferred promoters 

according to the invention may contain additional copies of one or more specific regulatory 
elements to further enhance expression in a cell, and/or to alter the timing of expression of a 
structural gene to which it is operably connected. 

[0077] The term "quality" is used herein in its broadest sense and includes a 
20 measure, strength, intensity, degree or grade of a phenotype, e.g. , a superior or inferior 
immune response. 

[0078] The term "sequence identity" as used herein refers to the extent that 
sequences are identical on a nucleotide-by-nucleotide basis or an amino acid-by-amino acid 
basis over a window of comparison. Thus, a "percentage of sequence identity" is calculated 

25 by comparing two optimally aligned sequences over the window of comparison, determining 
the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, I) or the 
identical amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, He, Phe, Tyr, Trp, Lys, 
Arg, His, Asp, Glu, Asn, Gin, Cys and Met) occurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total number of positions 

30 in the window of comparison (i.e., the window size), and multiplying the result by 100 to 
yield the percentage of sequence identity. For the purposes of the present invention, 
"sequence identity" will be understood to mean the "match percentage" calculated by the 
DNASIS computer program (Version 2.5 for windows; available from Hitachi Software 
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engineering Co., Ltd., South San Francisco, California, USA) using standard defaults as used 
in the reference manual accompanying the software. 

[0079] "Similarity" refers to the percentage number of amino acids that are 
identical or constitute conservative substitutions as defined in Table 10. Similarity may be 
5 determined using sequence comparison programs such as GAP (Deveraux et al. 1984, Nucleic 
Acids Research 12, 387-395). In this way, sequences of a similar or substantially different 
length to those cited herein might be compared by insertion of gaps into the alignment, such 
gaps being determined, for example, by the comparison algorithm used by GAP. 

[0080] Terms used to describe sequence relationships between two or more 

10 polynucleotides or polypeptides include "reference sequence", "comparison window", 
"sequence identity", "percentage of sequence identity" and "substantial identity". A 
"reference sequence" is at least 12 but frequently 15 to 18 and often at least 25 monomer 
units, inclusive of nucleotides and amino acid residues, in length. Because two 
polynucleotides may each comprise (1) a sequence (i.e., only a portion of the complete 

15 polynucleotide sequence) that is similar between the two polynucleotides, and (2) a sequence 
that is divergent between the two polynucleotides, sequence comparisons between two (or 
more) polynucleotides are typically performed by comparing sequences of the two 
polynucleotides over a "comparison window" to identify and compare local regions of 
sequence similarity. A "comparison window" refers to a conceptual segment of at least 6 

20 contiguous positions, usually about 50 to about 100, more usually about 100 to about 150 in 
which a sequence is compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. The comparison window may 
comprise additions or deletions (i.e., gaps) of about 20% or less as compared to the reference 
sequence (which does not comprise additions or deletions) for optimal alignment of the two 

25 sequences. Optimal alignment of sequences for aligning a comparison window may be 

conducted by computerized implementations of algorithms (GAP, BESTFIT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer 
Group, 575 Science Drive Madison, WI, USA) or by inspection and the best alignment (i.e., 
resulting in the highest percentage homology over the comparison window) generated by any 

30 of the various methods selected. Reference also may be made to the BLAST family of 
programs as for example disclosed by Altschul et al, 1997, Nucl. Acids Res. 25:3389. A 
detailed discussion of sequence analysis can be found in Unit 19.3 of Ausubel et al, "Current 
Protocols in Molecular Biology", John Wiley & Sons Inc, 1994-1998, Chapter 15. 
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[0081] The term "synthetic polynucleotide" as used herein refers to a 
polynucleotide that is formed by recombinant or synthetic techniques and typically includes 
polynucleotides that are not normally found in nature. 

[0082] The term "synonymous codon" as used herein refers to a codon having a 
different nucleotide sequence than another codon but encoding the same amino acid as that 
other codon. 

[0083] By "treatment," "treat," "treated" and the like is meant to include both 
therapeutic and prophylactic treatment. 

[0084] By "vector" is meant a nucleic acid molecule, preferably a DNA molecule 
derived, for example, from a plasmid, bacteriophage, or plant virus, into which a nucleic acid 
sequence may be inserted or cloned. A vector preferably contains one or more unique 
restriction sites and may be capable of autonomous replication in a defined host cell including 
a target cell or tissue or a progenitor cell or tissue thereof, or be integrable with the genome of 
the defined host such that the cloned sequence is reproducible. Accordingly, the vector may 
be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, 
the replication of which is independent of chromosomal replication, e.g., a linear or closed 
circular plasmid, an extrachromosomal element, a minichromosome, or an artificial 
chromosome. The vector may contain any means for assuring self-replication. Alternatively, 
the vector may be one which, when introduced into the host cell, is integrated into the genome 
and replicated together with the chromosome(s) into which it has been integrated. A vector 
system may comprise a single vector or plasmid, two or more vectors or plasmids, which 
together contain the total DNA to be introduced into the genome of the host cell, or a 
transposon. The choice of the vector will typically depend on the compatibility of the vector 
with the host cell into which the vector is to be introduced. The vector may also include a 
selection marker such as an antibiotic resistance gene that can be used for selection of suitable 
transformants. Examples of such resistance genes are well known to those of skill in the art. 
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2. Abbreviations 

[0085] The following abbreviations are used throughout the application: 

nt = nucleotide 
nts = nucleotides 
aa = amino acid(s) 
kb = kilobase(s) or kilobase pair(s) 
kDa = kilodalton(s) 
d== day 
h = hour 
s = seconds 



3. Immune response preference ranking of codons in mammals 
5 [0086] The present invention provides for the first time an immune response 

preference ranking of individual synonymous codons in mammals. This ranking was 
determined using a construct system that comprises a series of reporter constructs each 
comprising a different coding sequence for an antigenic polypeptide (e.g., a papillomavirus 
E7 polypeptide), wherein the coding sequence of individual constructs is distinguished from a 

10 parent coding sequence that encodes the antigenic polypeptide by the substitution of a single 
species of iso-accepting codon for each other species of iso-accepting codon that is present in 
the parent coding sequence. Accordingly, the coding sequence of individual synthetic 
constructs uses the same iso-accepting codon to encode most instances and preferably every 
instance of a particular amino acid residue (e.g., Ala GCT for all alanines) in the antigenic 

15 polypeptide and individual synthetic constructs differ in the species of iso-accepting codon 
used to encode a particular amino acid residue across the polypeptide sequence. As used 
herein, the species of iso-accepting codon that is used to encode a particular amino acid 
residue in the antigenic polypeptide is referred to as a "standardized codon". An illustrative 
synthetic construct system is described in Example 1, which covers the entire set of 

20 synonymous codons that code for amino acids. 

[0087] Test mammals (e.g. 9 mice) were immunized with the synthetic construct 
system in which individual mammals were immunized with a different synthetic construct and 
the host immune response (e.g., a humoral immune response or a cellular immune response) 
to the antigenic polypeptide was determined for each construct. In accordance with the 
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present invention, the strength of immune response obtained from individual synthetic 
constructs provides a direct correlation to the immune preference of a corresponding 
standardized codon in a test mammal. Accordingly, the stronger the immune response 
produced from a given construct in a test mammal, the higher the immune preference will be 
5 of the corresponding standardized codon. 

[0088] Comparison of the immune response preferences so determined with the 
translational efficiencies derived from codon usage frequency values for mammalian cells in 
general as determined by Seed (see U.S. Patent Serial Nos 5,786,464 and 5,795,737) reveals 
several differences in the ranking of codons. For convenience, these differences are 
10 highlighted in TABLE 9, in which Seed 'preferred' codons are highlighted with a blue 

background, Seed 'less preferred' codons are highlighted with a green background, and Seed 
6 non preferred' codons are highlighted with a grey background. 



TABLE 9 



aa 


by Seed for mammalian cells in 
general 


Rxnerimentallv determined codon 
immune response preferences in test 

IllcUIlIIlal b 


Ala 


GCC » (GCG, GCT, GCA) 


GCT > GCC > (GCA GCG) | 


Arg 


CGC » (CGA, CGT, AGA, AGG, 
CGG) 


(CGA, CGC, CGT, AGA) > (AGG, 
CGG) 


Asn 


AAC » AAT 


AAC > AAT 


Asp 


GAC » GAT 


GAC > GAT 


Cys 


TGC » TGT 


TGC > TGT 


Glu 


(GAA, GAG) 


GAA > GAG 


Gin I 


CAG » CAA 


CAA = CAG 


Gly 


GGC > GGG > (GGT, GGA) 


GGA > (GGG, GGT, GGC) 


His 


CAC » CAT 


CAC = CAT 


He 


ATC > ATT > ATA 


ATC » ATT > ATA 


Leu 


CTG > CTC > (TTA,CTA, CTT, 
TTG) 


(CTG, CTC) > (CTA, CTT) » TTG > 
TTA 


Lys 


AAG » AAA 


AAG = AAA 
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jrreicreniiai couon Uodgc <xi> prcu.iL/icci 
by Seed for mammalian cells in 
general 


ij/Apcriiiicniciiiy Licicr milieu L/Dlujii 

immune response preferences in test 
mammals 


Phe 


TTC » TTT 


TTT > TTC 


Pro 


CCC » (CCG, CCA, CCT) 


CCC > CCT » (CCA, CCG) 


Ser 


AGC > TCC > (TCG, AGT, TCA, 
TCT) 


TCG » (TCT, TCA, TCC) » (AGC, 
AGT) 


i nr 


Arr »/ , Arr T a pa APT"! 


ACCr > ACC » AC A > APT 


Tyr 


TAC » TAT 


TAC > TAT 


Val 


GTG > GTC > (GTA, GTT) 


(GTG, GTC) > GTT > GTA 



[0089] As will be apparent from the above table: 

[0090] (i) several codons deemed by Seed to have a higher codon usage ranking in 
mammalian cells than at least one other synonymous codon have in fact a lower immune 

5 response preference ranking than the or each other synonymous codon (e.g. , Ala 000 has a 
higher codon usage ranking but lower immune response preference ranking than Ala GCT ; 
Gly GGC has a higher codon usage ranking but lower immune response preference ranking than 
Gly GGA ; Phe TTC has a higher codon usage ranking but lower immune response preference 
ranking than Phe 1 ^; Ser AGC has a higher codon usage ranking but lower immune response 

1 0 preference ranking than any one of Ser TCG Ser TCT , Ser TCG , Ser TCA and Ser TCC ; and Thr^ 0 has 
a higher codon usage ranking but lower immune response preference ranking than Thr ACG ); 

[0091] (ii) several codons deemed by Seed to have a lower codon usage ranking in 
mammalian cells than at least one other synonymous codon have in fact a higher immune 
response preference ranking than the or each other synonymous codon (e.g., Ala has a 

1 5 lower codon usage ranking but higher immune response preference ranking than Ala GCC ; 

Gly GGA has a lower codon usage ranking but higher immune response preference ranking than 
Gly GGC or Gly GGG ; Phe TTT has a lower codon usage ranking but higher immune response 
preference ranking than Phe" 0 ; Ser TCG has a lower codon usage ranking but higher immune 
response preference ranking than Ser AGC or Ser TCC ; Ser TCT and Ser TCA have a lower codon 

20 usage ranking but higher immune response preference ranking than Ser ; and Thr has a 
lower codon usage ranking but higher immune response preference ranking than Thr ACC ); 
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[0092] (iii) several codons deemed by Seed to have a higher codon usage ranking in 
mammalian cells than another synonymous codon have in fact the same immune response 
preference ranking as the other synonymous codon (e.g., Gln CAG has a higher codon usage 
ranking than, but the same immune response preference ranking as, Gln CAA ; His CAC has a 
5 higher codon usage ranking than, but the same immune response preference ranking as, 
His CAT ; Leu CTG has a higher codon usage ranking than, but the same immune response 
preference ranking as Leu CTC ; Lys^ 0 has a higher codon usage ranking than, but the same 
immune response preference ranking as, Lys AAA ; Val GTG has a higher codon usage ranking 
than, but the same immune response preference ranking as, Val GTC ); and 

10 [0093] (iv) several codons deemed by Seed to have the same codon usage ranking 

in mammalian cells as at least one other synonymous codon have in fact a different immune 
response preference ranking than the or each other synonymous codon (e.g., Ala GCT has the 
same codon usage ranking as, but a higher immune response preference ranking than, Ala GCA 
and Ala GCG ; Arg CGA Arg CGT and Arg AGA have the same codon usage ranking as, but a higher 

15 immune response preference ranking than, Arg AGG and Arg CGG ; Glu GAA has the same codon 
usage ranking as, but a higher immune response preference ranking than, Glu ; Gly has 
the same codon usage ranking as, but a higher immune response preference ranking than, 
Gly GGT ; Leu CTA and Leu CTT have the same codon usage ranking as, but a higher immune 
response preference ranking than, Leu TTG and Leu TTA ; Pro CCT has the same codon usage 

20 ranking as, but a higher immune response preference ranking than, Pro CCA or Pro CCG ; Ser TCG 
has the same codon usage ranking as, but a higher immune response preference ranking than, 
any one of Ser TCT , Ser TCA and Ser AGT ; Ser TCT and Ser TCA have the same codon usage ranking 
as, but a higher immune response preference ranking than, Ser AGT ; Thr ACG has the same codon 
usage ranking as, but a higher immune response preference ranking than, any one of Thr ACA 

25 and Thr ACT ; Thr ACG has the same codon usage ranking as, but a higher immune response 
preference ranking than, Thr ACT ; Val GTT has the same codon usage ranking as, but a higher 
immune response preference ranking than, Val GTA ). 

[0094] Accordingly, the present invention enables for the first time the modulation 
of an immune response to a target antigen in a mammal from a polynucleotide that encodes a 
30 polypeptide that corresponds to at least a portion of the target antigen by replacing at least one 
codon of the polynucleotide with a synonymous codon that has a higher or lower preference 
for producing an immune response than the codon it replaces. In some embodiments, 
therefore, the present invention embraces methods of constructing a synthetic polynucleotide 
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from which a polypeptide is producible to confer an enhanced or stronger immune response 
than one conferred by a parent polynucleotide that encodes the same polypeptide. These 
methods generally comprise selecting from TABLE 1 a codon (often referred to herein 
arbitrarily as a "first codon") of the parent polynucleotide for replacement with a synonymous 
5 codon, wherein the synonymous codon is selected on the basis that it exhibits a higher 
immune response preference than the first codon and replacing the first codon with the 
synonymous codon to construct the synthetic polynucleotide. Illustrative selections of the first 
and synonymous codons are made according to TABLE 2. 

[0095] In some embodiments, the selection of the first and synonymous codons is 
10 made according to TABLE 3, which is the same as TABLE 2 with the exception that it 
excludes selections based on codon usage rankings as disclosed by Seed. In illustrative 
examples of this type, the selection of a second codon (and subsequent codons if desired) for 
replacement with a synonymous codon is made according to TABLE 4. 

[0096] Where synonymous codons are classified into three ranks ('high', 
15 'intermediate' and 'low 5 ranks) based on their immune response preference ranking (e.g. , the 
synonymous codons for Ala, He, Leu, Pro, Ser, Thr and Val), it is preferred that the 
synonymous codon that is selected is a high rank codon when the first codon is a low rank 
codon. However, this is not essential and the synonymous codon can be selected from 
intermediate rank codons. In the case of two or more synonymous codons having similar 
20 immune response preferences, it will be appreciated that any one of these codons can be used 
to replace the first codon. 

[0097] In other embodiments, the invention provides methods of constructing a 
synthetic polynucleotide from which a polypeptide is producible to confer a reduced or 
weaker immune response than one conferred by a parent polynucleotide that encodes the same 

25 polypeptide. These methods generally comprise selecting from TABLE 1 a first codon of the 
parent polynucleotide for replacement with a synonymous codon, wherein the synonymous 
codon is selected on the basis that it exhibits a lower immune response preference than the 
first codon and replacing the first codon with the synonymous codon to construct the synthetic 
polynucleotide. Illustrative selections of the first and synonymous codons are made according 

30 to TABLE 5. 

[0098] In some embodiments, the selection of the first and synonymous codons is 

made according to TABLE 6, which is the same as TABLE 5 with the exception that it 

excludes selections based on codon usage rankings as disclosed by Seed. In illustrative 
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examples of this type, the selection of a second codon (and subsequent codons if desired) for 
replacement with a synonymous codon is made according to TABLE 7. 

[0099] Where synonymous codons are classified into the three ranks noted above, it 
is preferred that the synonymous codon that is selected is a low rank codon when the first 
5 codon is a high rank codon but this is not essential and thus the synonymous codon can be 
selected from intermediate rank codons if desired. 

[0100] Generally, the difference in strength of the immune response produced in 
the mammal from the synthetic polynucleotide as compared to that produced from the parent 
polynucleotide depends on the number of first/second codons that are replaced by 

10 synonymous codons, and on the difference in immune response preference ranking between 
the first/second codons and the synonymous codons. Put another way, the fewer such 
replacements, and/or the smaller the difference in immune response preference ranking 
between the synonymous and first/codons codons, the smaller the difference will be in the 
immune response produced by the synthetic polynucleotide and the one produced by the 

15 parent polynucleotide. Conversely, the more such replacements, and/or the greater the 

difference in immune response preference ranking between the synonymous and first/second 
codons, the greater the difference will be in the immune response produced by the synthetic 
polynucleotide and the one produced by the parent polynucleotide. 

[0101] It is preferable but not necessary to replace all the codons of the parent 
20 polynucleotide with synonymous codons having different (e.g., higher or lower) immune 

response preference rankings than the first/second codons. Changes in the conferred immune 
response can be accomplished even with partial replacement. Generally, the replacement step 
affects at least about 5%, 10%, 15%, 20%, 25%, 30%, usually at least about 35%, 40%, 50%, 
and typically at least about 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or more of the 
25 first/second codons of the parent polynucleotide. In embodiments in which a stronger or 

enhanced immune response is required, it is generally desirable to replace some, preferably 
most and more preferably all, low rank codons in a parent polynucleotide with synonymous 
codons that are intermediate, or preferably high rank codons. Typically, replacement of low 
with intennediate or high rank codons will result in an increase in the strength of immune 
30 response from the synthetic polynucleotide so constructed, as compared to the one produced 
from the parent polynucleotide under the same conditions. However, it is often desirable to 
replace some, preferably most and more preferably all, intermediate rank codons in the parent 
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polynucleotide with high rank codons, if stronger or more enhanced immune responses are 
desired. 

[0102] By contrast, in some embodiments in which a weaker or reduced immune 
response is required, it is generally desirable to replace some, preferably most and more 
5 preferably all, high rank codons in a parent polynucleotide with synonymous codons that are 
intermediate, or preferably low rank codons. Typically, replacement of high with intermediate 
or low rank codons will result in a substantial decrease in the strength of immune response 
from the synthetic polynucleotide so constructed, as compared to the one produced from the 
parent polynucleotide under the same condition. In specific embodiments in which it is 
1 0 desired to confer a weaker or more reduced immune response, it is generally desirable to 

replace some, preferably most and more preferably all, intermediate rank codons in the parent 
polynucleotide with low rank codons. 

[0103] In illustrative examples requiring a stronger or enhanced immune response, 
the number of, and difference in immune response preference ranking between, the 

15 first/second codons and the synonymous codons are selected such that the immune response 
conferred by the synthetic polynucleotide is at least about 1 10%, 150%, 200%, 250%, 300%, 
350%, 400%, 450%, 500%, 600%, 700%, 800%, 900%, 1000%, or more, of the immune 
response conferred by the parent polynucleotide under the same conditions. Conversely, in 
some embodiments requiring a lower or weaker immune response, the number of, and 

20 difference in phenotypic preference ranking between, the first/second codons and the 

synonymous codons are selected such that the immune response conferred by the synthetic 
polynucleotide is no more than about 90%, 80%, 70%, 60%, 50%, 40%, 30%, 20%, 10%, 5%, 
or less of the immune response conferred by the parent polynucleotide under the same 
conditions. 

25 4. Modulating immune responses in mammals by expression of isoaccepting transfer 
RNA-encoding polynucleotides 

[0104] It is possible to take advantage of the immune response preference rankings 
of codons discussed in Section 3 to modulate an immune response to a target antigen by 
changing the level of iso-tRNAs in the cell population which is the target of the 
30 immunization. Accordingly, the invention also features methods of enhancing the quality of 
an immune response to a target antigen in a mammal, wherein the response is conferred by the 
expression of a first polynucleotide that encodes a polypeptide corresponding to at least a 
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portion of the target antigen. These methods generally comprise: introducing into the mammal 
a first nucleic acid construct comprising the first polynucleotide in operable connection with a 
regulatory polynucleotide. A second nucleic acid construct is then introduced into the 
mammal, which comprises a second polynucleotide that is operably connected to a regulatory 
5 polynucleotide and that encodes an iso-tRNA corresponding to a low immune preference 
codon of the first polynucleotide. 

[0105] In practice, therefore, an iso-tRNA is introduced into the mammal by the 
second nucleic acid construct when the iso-tRNA corresponds to a low immune response 
preference codon in the first polynucleotide, which are suitably selected from the group 

10 consisting of Ala GCA , Ala GCG , Ala GCC , Arg AGG Arg CGG Asn^ 1 , Asp GAT , Cys TGT , Glu GAG 
Gly GGG Gly GGT , Gly GGC , Ile ATA , Ile ATT , Leu TTG Leu™, Leu CTA , Leu CTT , Phe TTC , Pro CCA , 
Pro CCG Pro CCT , So** Ser AGT , Ser TCT , Ser TCA , Ser TCC , Thr ACA , Thr ACT , Tyr TAT , Val GTA and 
Val GTT . In specific embodiments, the supplied iso-tRNAs are specific for codons that have 
6 low' immune response preference codons, which may be selected from the group consisting 

15 of Ala GCA , Ala GCG Arg AGG Arg CGG Asn^ 7 , Asp GAT , Cys TGT , Glu GAG Gly GGG Gly GGT , 

Gly GGC , Ile ATA , Leu TTG Leu TTA , Phe TTC , Pro CCA , Pro CCG Ser^ 0 , So* 0 *, Thr ACT , Tyr TAT and 
VaI GTA . The first construct (ie., antigen-expressing construct) and the second construct (i.e., 
the iso-tRNA-expressing construct) may be introduced simultaneously or sequentially (in 
either order) and may be introduced at the same or different sites. In some embodiments, the 

20 first and second constructs are contained in separate vectors. In other embodiments, they are 
contained in a single vector. If desired, two or more second constructs may be introduced each 
expressing a different iso-tRNA corresponding to a low preference codon of the first 
polynucleotide. The first and second nucleic acid constructs may be constructed and 
administered concurrently or contemporaneously to a mammal according to any suitable 

25 method, illustrative examples of which are discussed below for the chimeric constructs of the 
invention. 

[0106] In some embodiments, a plurality of different iso-tRNA-expressing 
constructs (e.g., 2, 3, 4, 5, 6,7 ,8 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more) are 
administered concurrently or contemporaneously with the antigen-expressing construct, 
30 wherein individual iso-tRNA-expressing constructs express a different iso-tRNA than other 
iso-tRNA-expressing constructs. 
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5. Antigens 

[0107] Target antigens useful in the present invention are typically proteinaceous 
molecules, representative examples of which include polypeptides and peptides. Target 
antigens may be selected from endogenous antigens produced by a host or exogenous antigens 
5 that are foreign to the host. Suitable endogenous antigens include, but are not restricted to, 
cancer or tumor antigens. Non-limiting examples of cancer or tumor antigens include antigens 
from a cancer or tumor selected from ABL1 proto-oncogene, AIDS related cancers, acoustic 
neuroma, acute lymphocytic leukemia, acute myeloid leukemia, adenocystic carcinoma, 
adrenocortical cancer, agnogenic myeloid metaplasia, alopecia, alveolar soft-part sarcoma, 

10 anal cancer, angiosarcoma, aplastic anemia, astrocytoma, ataxia-telangiectasia, basal cell 
carcinoma (skin), bladder cancer, bone cancers, bowel cancer, brain stem glioma, brain and 
CNS tumors, breast cancer, CNS tumors, carcinoid tumors, cervical cancer, childhood brain 
tumors, childhood cancer, childhood leukemia, childhood soft tissue sarcoma, 
chondrosarcoma, choriocarcinoma, chronic lymphocytic leukemia, chronic myeloid leukemia, 

15 colorectal cancers, cutaneous T-cell lymphoma, dermatofibrosarcoma protuberans, 

desmoplastic small round cell tumor, ductal carcinoma, endocrine cancers, endometrial 
cancer, ependymoma, oesophageal cancer, E wing's Sarcoma, Extra-Hepatic Bile Duct 
Cancer, Eye Cancer, Eye: Melanoma, Retinoblastoma, Fallopian Tube cancer, Fanconi 
anemia, fibrosarcoma, gall bladder cancer, gastric cancer, gastrointestinal cancers, 

20 gastrointestinal-carcinoid-tumor, genitourinary cancers, germ cell tumors, gestational- 

trophoblastic-disease, glioma, gynecological cancers, haematological malignancies, hairy cell 
leukemia, head and neck cancer, hepatocellular cancer, hereditary breast cancer, histiocytosis, 
Hodgkin's disease, human papillomavirus, hydatidiform mole, hypercalcemia, hypopharynx 
cancer, intraocular melanoma, islet cell cancer, Kaposi's sarcoma, kidney cancer, Langerhans 

25 cell histiocytosis, laryngeal cancer, leiomyosarcoma, leukemia, Li-Fraumeni syndrome, lip 
cancer, liposarcoma, liver cancer, lung cancer, lymphedema, lymphoma, Hodgkin's 
lymphoma, non-Hodgkin's lymphoma, male breast cancer, malignant-rhabdoid tumor of 
kidney, medulloblastoma, melanoma, Merkel cell cancer, mesothelioma, metastatic cancer, 
mouth cancer, multiple endocrine neoplasia, mycosis fungoides, myelodysplastic syndromes, 

30 myeloma, myeloproliferative disorders, nasal cancer, nasopharyngeal cancer, nephroblastoma, 

neuroblastoma, neurofibromatosis, Nijmegen breakage syndrome, non-melanoma skin cancer, 

non-small-cell-lung-cancer (NSCLC), ocular cancers, esophageal cancer, oral cavity cancer, 

oropharynx cancer, osteosarcoma, ostomy ovarian cancer, pancreas cancer, paranasal cancer, 

parathyroid cancer, parotid gland cancer, penile cancer, peripheral-neuroectodermal tumours, 
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pituitary cancer, polycythemia vera, prostate cancer, rare cancers and associated disorders, 
renal cell carcinoma, retinoblastoma, rhabdomyosarcoma, Rothmund-Thomson syndrome, 
salivary gland cancer, sarcoma, schwannoma, Sezary syndrome, skin cancer, small cell lung 
cancer (SCLC), small intestine cancer, soft tissue sarcoma, spinal cord tumors, squamous- 
5 cell-carcinoma-(skin), stomach cancer, synovial sarcoma, testicular cancer, thymus cancer, 
thyroid cancer, transitional-cell-cancer-(bladder), transitional-cell-cancer-(renal-pelvis-/- 
ureter), trophoblastic cancer, urethral cancer, urinary system cancer, uroplakins, uterine 
sarcoma, uterus cancer, vaginal cancer, vulva cancer, Waldenstroms macroglobulinemia, 
Wilms 5 tumor. In certain embodiments, the cancer or tumor relates to melanoma. Illustrative 

10 examples of melanoma-related antigens include melanocyte differentiation antigen (e.g., 
gplOO, MART, Melan- A/MART- 1 , TRP-1, Tyros, TRP2, MC1R, MUC1F, MUC1R or a 
combination thereof) and melanoma-specific antigens (e.g., BAGE, GAGE-1, gpl00In4, 
MAGE-1 (e.g., GenBank Accession No. X54156 and AA49431 1), MAGE-3, MAGE4, 
PRAME, TRP2IN2, NYNSOla, NYNSOlb, LAGE1, p97 melanoma antigen (e.g., GenBank 

15 Accession No. M12154) p5 protein, gp75, oncofetal antigen, GM2 and GD2 gangliosides, 

cdc27, p21ras, gpl00 Pmel117 or a combination thereof. Other tumour-specific antigens include, 
but are not limited to; etv6, amll, cyclophilin b (acute lymphoblastic leukemia); Ig-idiotype 
(B cell lymphoma); E-cadherin, a-catenin, P-catenin, y -catenin, pl20ctn (glioma); p21ras 
(bladder cancer); p21ras (biliary cancer); MUG family, HER2/neu, c-erbB-2 (breast cancer); 

20 p53, p21ras (cervical carcinoma); p21ras, HER2/neu, c-erbB-2, MUG family, Cripto-1 protein, 
Pim-1 protein (colon carcinoma); Colorectal associated antigen (CRC)-C017~1A/GA733, 
APC (colorectal cancer); carcinoembryonic antigen (CEA) (colorectal cancer; 
choriocarcinoma); cyclophilin b (epithelial cell cancer); HER2/neu, c-erbB-2, ga733 
glycoprotein (gastric cancer); a-fetoprotein (hepatocellular cancer); Imp-1, EBNA-1 

25 (Hodgkin's lymphoma); CEA, MAGE-3, NY-ESO-l (lung cancer); cyclophilin b (lymphoid 
cell-derived leukemia); MUC family, p21ras (myeloma); HER2/neu, c-erbB-2 (non-small cell 
lung carcinoma); Imp-1, EBNA-1 (nasopharyngeal cancer); MUC family, HER2/neu, c-erbB- 
2, MAGE-A4, NY-ESO-l (ovarian cancer); Prostate Specific Antigen (PSA) and its antigenic 
epitopes PSA-1, PSA-2, and PSA-3, PSMA, HER2/neu, c-erbB-2, ga733 glycoprotein 

30 (prostate cancer); HER2/neu, c-erbB-2 (renal cancer); viral products such as human 

papillomavirus proteins (squamous cell cancers of the cervix and esophagus); NY-ESO-l 
(testicular cancer); and HTLV-1 epitopes (T cell leukemia). 
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[0108] Foreign or exogenous antigens are suitably selected from antigens of 
pathogenic organisms. Exemplary pathogenic organisms include, but are not limited to, 
viruses, bacteria, fungi parasites, algae and protozoa and amoebae. Illustrative viruses include 
viruses responsible for diseases including, but not limited to, measles, mumps, rubella, 
5 poliomyelitis, hepatitis A, B (e.g., GenBank Accession No. E02707), and C (e.g., GenBank 
Accession No. E06890), as well as other hepatitis viruses, influenza, adenovirus (e.g., types 
4 and 7), rabies (e.g., GenBank Accession No. M34678), yellow fever, Epstein-Barr virus 
and other herpesviruses such as papillomavirus, Ebola virus, influenza virus, Japanese 
encephalitis (e.g., GenBank Accession No. E07883), dengue (e.g., GenBank Accession No. 

10 M24444), hantavirus, Sendai virus, respiratory syncytial virus, orthomyxoviruses, vesicular 
stomatitis virus, visna virus, cytomegalovirus and human immunodeficiency virus (HIV) 
(e.g., GenBank Accession No. U 18552). Any suitable antigen derived from such viruses are 
useful in the practice of the present invention. For example, illustrative retroviral antigens 
derived from HIV include, but are not limited to, antigens such as gene products of the gag, 

15 pol, and env genes, the Nef protein, reverse transcriptase, and other HIV components. 

Illustrative examples of hepatitis viral antigens include, but are not limited to, antigens such 
as the S, and L proteins of hepatitis B virus, the pre-S antigen of hepatitis B virus, and 
other hepatitis, e.g., hepatitis A, B, and C, viral components such as hepatitis C viral RNA. 
Illustrative examples of influenza viral antigens include; but are not limited to, antigens 

20 such as hemagglutinin and neuraminidase and other influenza viral components. Illustrative 
examples of measles viral antigens include, but are not limited to, antigens such as the 
measles virus fusion protein and other measles virus components. Illustrative examples of 
rubella viral antigens include, but are not limited to, antigens such as proteins El and E2 and 
other rubella virus components; rotaviral antigens such as VP7sc and other rotaviral 

25 components. Illustrative examples of cytomegaloviral antigens include, but are not limited 
to, antigens such as envelope glycoprotein B and other cytomegaloviral antigen 
components. Non-limiting examples of respiratory syncytial viral antigens include antigens 
such as the RSV fusion protein, the M2 protein and other respiratory syncytial viral antigen 
components. Illustrative examples of herpes simplex viral antigens include, but are not 

30 limited to, antigens such as immediate early proteins, glycoprotein D, and other herpes 

simplex viral antigen components. Non-limiting examples of varicella zoster viral antigens 
include antigens such as 9PI, gpll, and other varicella zoster viral antigen components. Non- 
limiting examples of Japanese encephalitis viral antigens include antigens such as proteins 
E, M-E, M-E-NS 1, NS 1, NS 1-NS2A, 80%E, and other Japanese encephalitis viral antigen 
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components. Representative examples of rabies viral antigens include, but are not limited to, 
antigens such as rabies glycoprotein, rabies nucleoprotein and other rabies viral antigen 
components. Illustrative examples of papillomavirus antigens include, but are not limited to, 
the LI and L2 capsid proteins as well as the E6/E7 antigens associated with cervical 
5 cancers, See Fundamental Virology, Second Edition, eds. Fields, B.N. and Knipe, D.M., 
1991, Raven Press, New York, for additional examples of viral antigens. 

[0109] Illustrative examples of fungi include Acremonium spp., Aspergillus spp., 
Basidiobolus spp., Bipolaris spp., Blastomyces dermatidis, Candida spp., Cladophialophora 
carrionii, Coccidioides immitis, Conidiobolus spp., Cryptococcus spp., Curvularia spp., 

10 Epidermophyton spp., Exophiala jeanselmei, Exserohilum spp., Fonsecaea compacta, 
Fonsecaea pedrosoi, Fusarium oxysporum, Fusarium solani, Geotrichum candidum, 
Histoplasma capsulatum var. capsulatum, Histoplasma capsulatum var. duboisii, Hortaea 
werneckii, Lacazia loboi, Lasiodiplodia theobromae, Leptosphaeria senegalensis, Madurella 
grisea, Madurella mycetomatis, Malassezia furfur, Microsporum spp., Neotestudina rosatii, 

15 Onychocola canadensis , Paracoccidioides brasiliensis, Phialophora verrucosa, Piedraia 
hortae, Piedra iahortae, Pityriasis versicolor, Pseudallescheria boydii, Pyrenochaeta 
romeroi, Rhizopus arrhizus, Scopulariopsis brevicaulis, Scytalidium dimidiatum, Sporothrix 
schenckii, Trichophyton spp., Trichosporon spp., Zygomycete fungi, Absidia corymbifera, 
Rhizomucor pusillus and Rhizopus arrhizus. Thus, representative fungal antigens that can be 

20 used in the compositions and methods of the present invention include, but are not limited to, 
Candida fungal antigen components; histoplasma fungal antigens such as heat shock protein 
60 (HSP60) and other histoplasma fungal antigen components; cryptococcal fungal antigens 
such as capsular polysaccharides and other cryptococcal fungal antigen components; 
coccidioides fungal antigens such as spherule antigens and other coccidioides fungal antigen 

25 components; and tinea fungal antigens such as trichophytin and other coccidioides fungal 
antigen components. 

[0110] Illustrative examples of bacteria include bacteria that are responsible for 
diseases including, but not restricted to, diphtheria (e.g., Corynebacterium diphtheria), 
pertussis {e.g., Bordetella pertussis, GenBank Accession No. M35274), tetanus (e.g., 
30 Clostridium tetani, GenBank Accession No. M64353), tuberculosis (e.g. , Mycobacterium 
tuberculosis), bacterial pneumonias (e.g., Haemophilus influenzae.), cholera (e.g., Vibrio 
cholerae), anthrax (e.g., Bacillus anthracis), typhoid, plague, shigellosis (e.g., Shigella 
dysenteriae), botulism (e.g., Clostridium botulinum), salmonellosis (e.g., GenBank 
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Accession No. L03833), peptic ulcers (e.g., Helicobacter pylori), Legionnaire's Disease, 
Lyme disease (e.g., GenBank Accession No. U59487), Other pathogenic bacteria include 
Escherichia coli, Clostridium perfringens, Pseudomonas aeruginosa, Staphylococcus aureus 
and Streptococcus pyogenes. Thus, bacterial antigens which can be used in the compositions 
5 and methods of the invention include, but are not limited to: pertussis bacterial antigens 
such as pertussis toxin, filamentous hemagglutinin, pertactin, F M2, FIM3, adenylate 
cyclase and other pertussis bacterial antigen components; diphtheria bacterial antigens such 
as diphtheria toxin or toxoid and other diphtheria bacterial antigen components; tetanus 
bacterial antigens such as tetanus toxin or toxoid and other tetanus bacterial antigen 

10 components, streptococcal bacterial antigens such as M proteins and other streptococcal 
bacterial antigen components; gram-negative bacilli bacterial antigens such as 
lipopolysaccharides and other gram-negative bacterial antigen components; Mycobacterium 
tuberculosis bacterial antigens such as mycolic acid, heat shock protein 65 (HSP65), the 
30kDa major secreted protein, antigen 85 A and other mycobacterial antigen components; 

15 Helicobacter pylori bacterial antigen components, pneumococcal bacterial antigens such as 
pneumolysin, pneumococcal capsular polysaccharides and other pneumococcal bacterial 
antigen components; Haemophilus influenza bacterial antigens such as capsular 
polysaccharides and other Haemophilus influenza bacterial antigen components; anthrax 
bacterial antigens such as anthrax protective antigen and other anthrax bacterial antigen 

20 components; rickettsiae bacterial antigens such as rompA and other rickettsiae bacterial 

antigen component. Also included with the bacterial antigens described herein are any other 
bacterial, mycobacterial, mycoplasmal, rickettsial, or chlamydial antigens. 

[0111] Illustrative examples of protozoa include protozoa that are responsible for 
diseases including, but not limited to, malaria (e.g., GenBank Accession No. X53832), 

25 hookworm, onchocerciasis (e.g., GenBank Accession No. M27807), schistosomiasis (e.g., 

GenBank Accession No. LOS 198), toxoplasmosis, trypanosomiasis, leishmaniasis, giardiasis 
(GenBank Accession No. M33641), amoebiasis, filariasis (e.g., GenBank Accession No. 
J03266), borreliosis, and trichinosis. Thus, protozoal antigens which can be used in the 
compositions and methods of the invention include, but are not limited to: Plasmodium 

30 falciparum antigens such as merozoite surface antigens, sporozoite surface antigens, 

circumsporozoite antigens, gametocyte/ gamete surface antigens, blood-stage antigen pf 

155/RESA and other plasmodial antigen components; toxoplasma antigens such as SAG-1, 

p30 and other toxoplasma antigen components; schistosoma antigens such as glutathione- S- 

transferase, paramyosin, and other schistosomal antigen components; leishmania major and 
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other leishmaniae antigens such as gp63, lipophosphoglycan and its associated protein and 
other leishmanial antigen components; and trypanosoma cruzi antigens such as the 75-77kDa 
antigen, the 56kDa antigen and other trypanosomal antigen components, 

[0112] The present invention also contemplates toxin components as antigens, 
5 illustrative examples of which include staphylococcal enterotoxins, toxic shock syndrome 
toxin; retroviral antigens (e.g., antigens derived from HIV), streptococcal antigens, 
staphylococcal enterotoxin-A (SEA), staphylococcal enterotoxin-B (SEB), staphylococcal 
enterotoxini-3 (SE1-3), staphylococcal enterotoxin-D (SED), staphylococcal enterotoxin-E 
(SEE) as well as toxins derived from mycoplasma, mycobacterium, and herpes viruses. 

10 6. Construction of synthetic polynucleotides 

[0113] Replacement of one codon for another can be achieved using standard 
methods known in the art. For example codon modification of a parent polynucleotide can be 
effected using several known mutagenesis techniques including, for example, oligonucleotide- 
directed mutagenesis, mutagenesis with degenerate oligonucleotides, and region-specific 

15 mutagenesis. Exemplary in vitro mutagenesis techniques are described for example in U.S. 
Patent Nos. 4,1 84,917, 4,321,365 and 4,351,901 or in the relevant sections of Ausubel, et al 
(CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, Inc. 1997) 
and of Sambrook, et al, (MOLECULAR CLONING. A LABORATORY MANUAL, Cold 
Spring Harbor Press, 1989). Instead of in vitro mutagenesis, the synthetic polynucleotide can 

20 be synthesized de novo using readily available machinery as described, for example, in U.S. 
Patent No 4,293,652. However, it should be noted that the present invention is not dependent 
on, and not directed to, any one particular technique for constructing the synthetic 
polynucleotide. 

[0114] The parent polynucleotide is suitably a natural gene. However, it is possible 
25 that the parent polynucleotide is not naturally-occurring but has been engineered using 

recombinant techniques. Parent polynucleotides can be obtained from any suitable source, 
such as from eukaryotic or prokaryotic organisms, including but not limited to mammals or 
other animals, and pathogenic organisms such as yeasts, bacteria, protozoa and viruses. 

[0115] The invention also contemplates synthetic polynucleotides encoding one or 
30 more desired portions of a target antigen. In some embodiments, the synthetic polynucleotide 
encodes at least about 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
30, 40, 50, 60, 70, 80, 90, 100, 120, 150, 300, 400, 500, 600, 700, 800, 900 or 1000, or even at 
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least about 2000, 3000, 4000 or 5000 contiguous amino acid residues, or almost up to the total 
number of amino acids present in a full-length target antigen. In some embodiments, the 
synthetic polynucleotide encodes a plurality of portions of the target antigen, wherein the 
portions are the same or different. In illustrative examples of this type, the synthetic 
5 polynucleotide encodes a multi-epitope fusion protein. A number of factors can influence the 
choice of portion size. For example, the size of individual portions encoded by the synthetic 
polynucleotide can be chosen such that it includes, or corresponds to the size of, T cell 
epitopes and/or B cell epitopes, and their processing requirements. Practitioners in the art will 
recognize that class I-restricted T cell epitopes are typically between 8 and 10 amino acid 

10 residues in length and if placed next to unnatural flanking residues, such epitopes can 

generally require 2 to 3 natural flaiiking amino acid residues to ensure that they are efficiently 
processed and presented. Class II-restricted T cell epitopes usually range between 12 and 25 
amino acid residues in length and may not require natural flanking residues for efficient 
proteolytic processing although it is believed that natural flanking residues may play a role. 

15 Another important feature of class II-restricted epitopes is that they generally contain a core 
of 9-10 amino acid residues in the middle which bind specifically to class II MHC molecules 
with flanking sequences either side of this core stabilizing binding by associating with 
conserved structures on either side of class II MHC antigens in a sequence independent 
manner. Thus the functional region of class II-restricted epitopes is typically less than about 

20 15 amino acid residues long. The size of linear B cell epitopes and the factors effecting their 
processing, like class II-restricted epitopes, are quite variable although such epitopes are 
frequently smaller in size than 15 amino acid residues. From the foregoing, it is advantageous, 
but not essential, that the size of individual portions of the target antigen is at least 6, 7, 8, 9, 
10, 1 1, 12, 13, 14, 15, 20, 25, 30 amino acid residues. Suitably, the size of individual portions 

25 is no more than about 500, 200, 100, 80, 60, 50, 40 amino acid residues. In certain 

advantageous embodiments, the size of individual portions is sufficient for presentation by an 
antigen-presenting cell of a T cell and/or a B cell epitope contained within the peptide. 

[0116] As will be appreciated by those of skill in the art, it is generally not 

necessary to immunize with a polypeptide that shares exactly the same amino acid sequence 

30 with the target antigen to produce an immune response to that antigen. In some embodiments, 

therefore, the polypeptide encoded by the synthetic polynucleotide is desirably a variant of at 

least a portion of the target antigen. "Variant" polypeptides include proteins derived from the 

target antigen by deletion (so-called truncation) or addition of one or more amino acids to the 

N-terminal and/or C -terminal end of the target antigen; deletion or addition of one or more 
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amino acids at one or more sites in the target antigen; or substitution of one or more amino 
acids at one or more sites in the target antigen. Variant polypeptides encompassed by the 
present invention will have at least 40%, 50%, 60%, 70%, generally at least 75%, 80%, 85%, 
typically at least about 90% to 95% or more, and more typically at least about 96%, 97%, 
5 98%, 99% or more sequence similarity or identity with the amino acid sequence of the target 
antigen or portion thereof as determined by sequence alignment programs described 
elsewhere herein using default parameters. A variant of a target antigen may differ from that 
antigen generally by as much 1000, 500, 400, 300, 200, 100, 50 or 20 amino acid residues or 
suitably by as few as 1-15 amino acid residues, as few as 1-10, such as 6-10, as few as 5, as 
10 few as 4, 3, 2, or even 1 amino acid residue. 

[0117] Variant polypeptides corresponding to at least a portion of a target antigen 
may contain conservative amino acid substitutions at various locations along their sequence, 
as compared to the target antigen amino acid sequence. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid residue 
15 having a similar side chain. Families of amino acid residues having similar side chains have 
been defined in the art, which can be generally sub-classified as follows: 

[0118] Acidic: The residue has a negative charge due to loss of H ion at 
physiological pH and the residue is attracted by aqueous solution so as to seek the surface 
positions in the conformation of a peptide in which it is contained when the peptide is in 
20 aqueous medium at physiological pH. Amino acids having an acidic side chain include 
glutamic acid and aspartic acid. 

[0119] Basic: The residue has a positive charge due to association with H ion at 
physiological pH or within one or two pH units thereof (e.g., histidine) and the residue is 
attracted by aqueous solution so as to seek the surface positions in the conformation of a 
25 peptide in which it is contained when the peptide is in aqueous medium at physiological pH. 
Amino acids having a basic side chain include arginine, lysine and histidine. 

[0120] Charged: The residues are charged at physiological pH and, therefore, 
include amino acids having acidic or basic side chains (i.e., glutamic acid, aspartic acid, 
arginine, lysine and histidine). 

30 [0121] Hydrophobic: The residues are not charged at physiological pH and the 

residue is repelled by aqueous solution so as to seek the inner positions in the conformation of 
a peptide in which it is contained when the peptide is in aqueous medium. Amino acids 
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having a hydrophobic side chain include tyrosine, valine, isoleucine, leucine, methionine, 
phenylalanine and tryptophan. 

[0122] Neutral/polar: The residues are not charged at physiological pH, but the 
residue is not sufficiently repelled by aqueous solutions so that it would seek inner positions 
5 in the conformation of a peptide in which it is contained when the peptide is in aqueous 
medium. Amino acids having a neutral/polar side chain include asparagine, glutamine, 
cysteine, histidine, serine and threonine. 

[0123] This description also characterizes certain amino acids as "small" since their 
side chains are not sufficiently large, even if polar groups are lacking, to confer 

10 hydrophobicity. With the exception of proline, "small" amino acids are those with four 

carbons or less when at least one polar group is on the side chain and three carbons or less 
when not. Amino acids having a small side chain include glycine, serine, alanine and 
threonine. The gene-encoded secondary amino acid proline is a special case due to its known 
effects on the secondary conformation of peptide chains. The structure of proline differs from 

15 all the other naturally-occurring amino acids in that its side chain is bonded to the nitrogen of 
the a-amino group, as well as the a-carbon. Several amino acid similarity matrices (e.g., 
PAM120 matrix and PAM250 matrix as disclosed for example by Dayhoff et at (1978) A 
model of evolutionary change in proteins. Matrices for determining distance relationships In 
M. O. Dayhoff, (ed.), Atlas of protein sequence and structure, Vol. 5, pp. 345-358, National 

20 Biomedical Research Foundation, Washington DC; and by Gonnet et al. , 1992, Science 
256(5062): 144301445), however, include proline in the same group as glycine, serine, 
alanine and threonine. Accordingly, for the purposes of the present invention, proline is 
classified as a "small" amino acid. 

[0124] The degree of attraction or repulsion required for classification as polar or 
25 nonpolar is arbitrary and, therefore, amino acids specifically contemplated by the invention 
have been classified as one or the other. Most amino acids not specifically named can be 
classified on the basis of known behavior. 

[0125] Amino acid residues can be further sub-classified as cyclic or noncyclic, and 

aromatic or nonaromatic, self-explanatory classifications with respect to the side-chain 

30 substituent groups of the residues, and as small or large. The residue is considered small if it 

contains a total of four carbon atoms or less, inclusive of the carboxyl carbon, provided an 

additional polar substituent is present; three or less if not. Small residues are, of course, 

always nonaromatic. Dependent on their structural properties, amino acid residues may fall in 
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two or more classes. For the naturally-occurring protein amino acids, sub-classification 
according to the this scheme is presented in the Table 10. 



TABLE 10 



Original Residue 


Exemplary Substitutions 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Asn, Gin 


lie 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, He, 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He, Leu 



5 [0126] Conservative amino acid substitution also includes groupings based on side 

chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, 
valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains 
is serine and threonine; a group of amino acids having amide-containing side chains is 
asparagine and glutamine; a group of amino acids having aromatic side chains is 
10 phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is 
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lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side 
chains is cysteine and methionine. For example, it is reasonable to expect that replacement of 
a leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, 
or a similar replacement of an amino acid with a structurally related amino acid will not have 
5 a major effect on the properties of the resulting variant polypeptide. Conservative 

substitutions are shown in Table 1 1 below under the heading of exemplary substitutions. 
More preferred substitutions are shown under the heading of preferred substitutions. Amino 
acid substitutions falling within the scope of the invention, are, in general, accomplished by 
selecting substitutions that do not differ significantly in their effect on maintaining (a) the 
10 structure of the peptide backbone in the area of the substitution, (b) the charge or 

hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. After the 
substitutions are introduced, the variants are screened for biological activity. 



TABLE 11 

Exemplary and Preferred Amino Acid Substitutions 



Original Residue 


Exemplary Substitutions 


Preferred 
Substitutions 


Ala 


Val, Leu, He 


Val 


Arg 


Lys, Gin, Asn 


Lys 


Asn 


Gin, His, Lys, Arg 


Gin 


Asp 


Glu 


Glu 


Cys 


Ser 


Ser 


Gin 


Asn, His, Lys, 


Asn 


Glu 


Asp, Lys 


Asp 


Gly 


Pro 


Pro 


His 


Asn, Gin, Lys, Arg 


Arg 


lie 


Leu, Val, Met, Ala, Phe, 
Norleu 


Leu 


Leu 


Norleu, lie, Val, Met, Ala, Phe 


He 


Lys 


Arg, Gin, Asn 


Arg 


Met 


Leii, He, Phe 


Leu 


Phe 


Leu; Val, lie, Ala 


Leu 



- 57 - 



SUBSTITUTE SHEET (RULE 26) RO/AU 



WO 2009/049350 



PCT/AU2008/001463 



Original Residue 


Exemplary Substitutions 


Preferred 
Substitutions 


Pro 


Gly 


Gly 


Ser 


Thr 


Thr 


Thr 


Ser 


Ser 


Trp 


Tyr 


Tyr 


Tyr 


Trp, Phe, Thr, Ser 


Phe 


Val 


He, Leu, Met, Phe, Ala, Norleu 


Leu 



[0127] Alternatively, similar amino acids for making conservative substitutions can 
be grouped into three categories based on the identity of the side chains. The first group 
includes glutamic acid, aspartic acid, arginine, lysine, histidine, which all have charged side 

5 chains; the second group includes glycine, serine, threonine, cysteine, tyrosine, glutamine, 
asparagine; and the third group includes leucine, isoleucine, valine, alanine, proline, 
phenylalanine, tryptophan, methionine, as described in Zubay, G., Biochemistry, third edition, 
Wm.C. Brown Publishers (1 993). 

[0128] The invention further contemplates a chimeric construct comprising a 

10 synthetic polynucleotide of the invention, which is operably linked to a regulatory 

polynucleotide. The regulatory polynucleotide suitably comprises transcriptional and/or 
translational control sequences, which will be compatible for expression in the organism of 
interest or in cells of that organism. Typically, the transcriptional and translational regulatory 
control sequences include, but are not limited to, a promoter sequence, a 5' non-coding 

1 5 region, a m-regulatory region such as a functional binding site for transcriptional regulatory 
protein or translational regulatory protein, an upstream open reading frame, ribosomal- 
binding sequences, transcriptional start site, translational start site, and/or nucleotide sequence 
which encodes a leader sequence, termination codon, translational stop site and a 3' non- 
translated region. Constitutive or inducible promoters as known in the art are contemplated by 

20 the invention. The promoters may be either naturally occurring promoters, or hybrid 

promoters that combine elements of more than one promoter. Promoter sequences 

contemplated by the present invention may be native to the organism of interest or may be 

derived from an alternative source, where the region is functional in the chosen organism. The 

choice of promoter will differ depending on the intended host or cell or tissue type. For 

25 example, promoters which could be used for expression in mammals include the 
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metallothionein promoter, which can be induced in response to heavy metals such as 
cadmium, the p-actin promoter as well as viral promoters such as the SV40 large T antigen 
promoter, human cytomegalovirus (CMV) immediate early (IE) promoter, Rous sarcoma 
virus LTR promoter, the mouse mammary tumor virus LTR promoter, the adenovirus major 
5 late promoter (Ad MLP), the herpes simplex virus promoter, and a HPV promoter, 

particularly the HPV upstream regulatory region (URR), among others. All these promoters 
are well described and readily available in the art. 

[0129] Enhancer elements may also be used herein to increase expression levels of 
the mammalian constructs. Examples include the SV40 early gene enhancer, as described for 
10 example in Dijkema et al (1985, EMBO J. 4:761), the enhancer/promoter derived from the 
long terminal repeat (LTR) of the Rous Sarcoma Virus, as described for example in Gorman 
et al, (1982, Proc. Natl. Acad. Sci. USA 79:6777) and elements derived from human CMV, 
as described for example in Boshart et al (1985, Cell 41 :521), such as elements included in 
the CMV intron A sequence. 

15 [0130] The chimeric construct may also comprise a 3' non-translated sequence. A 

3' non-translated sequence refers to that portion of a gene comprising a DNA segment that 
contains a polyadenylation signal and any other regulatory signals capable of effecting mRNA 
processing or gene expression. The polyadenylation signal is characterized by effecting the 
addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. Polyadenylation 

20 signals are commonly recognized by the presence of homology to the canonical form 5' 

AATAAA-3 5 although variations are not uncommon. The 3' non-translated regulatory DNA 
sequence preferably includes from about 50 to 1,000 nts and may contain transcriptional and 
translational termination sequences in addition to a polyadenylation signal and any other 
regulatory signals capable of effecting mRNA processing or gene expression. 

25 [0131] In some embodiments, the chimeric construct further contains a selectable 

marker gene to permit selection of cells containing the construct. Selection genes are well 
known in the art and will be compatible for expression in the cell of interest. 

[0132] It will be understood, however, that expression of protein-encoding 
polynucleotides in heterologous systems is now well known, and the present invention is not 
30 directed to or dependent on any particular vector, transcriptional control sequence or 

technique for expression of the polynucleotides. Rather, synthetic polynucleotides prepared 
according to the methods set forth herein may be introduced into a mammal in any suitable 



-59- 



WO 2009/049350 



PCT/AU2008/001463 



manner in the form of any suitable construct or vector, and the synthetic polynucleotides may 
be expressed with known transcription regulatory elements in any conventional manner. 

[0133] In addition, chimeric constructs can be constructed that include sequences 
coding for adjuvants. Particularly suitable are detoxified mutants of bacterial ADP- 
5 ribosylating toxins, for example, diphtheria toxin, pertussis toxin (PT), cholera toxin (CT), 
Escherichia coli heat-labile toxins (LT1 and LT2), Pseudomonas endotoxin A, Clostridium 
botulinum C2 and C3 toxins, as well as toxins from C. perfringens, C. spiriforma and C. 
difficile. In some embodiments, the chimeric constructs include coding sequences for 
detoxified mutants of E. coli heat-labile toxins, such as the LT-K63 and LT-R72 detoxified 

10 mutants, described in U.S. Pat. No. 6,818,222. In some embodiments, the adjuvant is a 

protein-destabilising element, which increases processing and presentation of the polypeptide 
that corresponds to at least a portion of the target antigen through the class I MHC pathway, 
thereby leading to enhanced cell-mediated immunity against the polypeptide. Illustrative 
protein-destabilising elements include intracellular protein degradation signals or degrons 

15 which may be selected without limitation from a destabilising amino acid at the amino- 

terminus of a polypeptide of interest, a PEST region or a ubiquitin. For example, the coding 
sequence for the polypeptide can be modified to include a destabilising amino acid at its 
ammo-terminus so that the protein so modified is subject to the N-end rule pathway as 
disclosed, for example, by Bachmair et al in U.S. Patent Serial No. 5,093,242 and by 

20 Varshavsky et al in U.S. Patent Serial No. 5,122,463. In some embodiments, the destabilising 
amino acid is selected from isoleucine and glutamic acid, especially from histidine tyrosine 
and glutamine, and more especially from aspartic acid, asparagine, phenylalanine, leucine, 
tryptophan and lysine. In certain embodiments, the destabilising amino acid is arginine. In 
some proteins, the amino-terminal end is obscured as a result of the protein's conformation 

25 (i.e., its tertiary or quaternary structure). In these cases, more extensive alteration of the 

amino -terminus may be necessary to make the protein subject to the N-end rule pathway. For 
example, where simple addition or replacement of the single amino-terminal residue is 
insufficient because of an inaccessible ammo-terminus, several amino acids (including lysine, 
the site of ubiquitin joining to substrate proteins) may be added to the original amino-terminus 

30 to increase the accessibility and/or segmental mobility of the engineered amino terminus. In 

some embodiments, a nucleic acid sequence encoding the amino-terminal region of the 

polypeptide can be modified to introduce a lysine residue in an appropriate context. This can 

be achieved most conveniently by employing DNA constructs encoding "universal 

destabilising segments". A universal destabilising segment comprises a nucleic acid construct 
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which encodes a polypeptide structure, preferably segmentally mobile, containing one or 
more lysine residues, the codons for lysine residues being positioned within the construct such 
that when the construct is inserted into the coding sequence of the protein-encoding synthetic 
polynucleotide, the lysine residues are sufficiently spatially proximate to the amino-terminus 
5 of the encoded protein to serve as the second determinant of the complete amino-terminal 
degradation signal. The insertion of such constructs into the 5 5 portion of a polypeptide- 
encoding synthetic polynucleotide would provide the encoded polypeptide with a lysine 
residue (or residues) in an appropriate context for destabilization. In other embodiments, the 
polypeptide is modified to contain a PEST region, which is rich in an amino acid selected 

10 from proline, glutamic acid, serine and threonine, which region is optionally flanked by amino 
acids comprising electropositive side chains. In this regard, it is known that amino acid 
sequences of proteins with intracellular half-lives less than about 2 hours contain one or more 
regions rich in proline (P), glutamic acid (E), serine (S), and threonine (T) as for example 
shown by Rogers et al. (1986, Science 234 (4774): 364-368). In still other embodiments, the 

15 polypeptide is conjugated to a ubiquitin or a biologically active fragment thereof, to produce a 
modified polypeptide whose rate of intracellular proteolytic degradation is increased, 
enhanced or otherwise elevated relative to the unmodified polypeptide. 

[0134] One or more adjuvant polypeptides may be co-expressed with an 'antigenic' 
polypeptide that corresponds to at least a portion of the target antigen. In certain 
20 embodiments, adjuvant and antigenic polypeptides may be co-expressed in the form of a 
fusion protein comprising one or more adjuvant polypeptides and one or more antigenic 
polypeptides. Alternatively, adjuvant and antigenic polypeptides may be co-expressed as 
separate proteins. 

[0135] Furthermore, chimeric constructs can be constructed that include chimeric 
25 antigen-coding gene sequences, encoding, e.g., multiple antigens/epitopes of interest, for 

example derived from a single or from more than one target antigen. In certain embodiments, 
multi-cistronic cassettes (e.g., bi-cistronic cassettes) can be constructed allowing expression 
of multiple adjuvants and/or antigenic polypeptides from a single mRNA using, for example, 
the EMCV IRES, or the like. In other embodiments, adjuvants and/or antigenic polypeptides 
30 can be encoded on separate coding sequences that are operably connected to independent 
transcription regulatory elements. 

[0136] In some embodiments, the chimeric constructs of the invention are in the 
form of expression vectors which are suitably selected from self-replicating extra- 
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chromosomal vectors (e.g., plasmids) and vectors that integrate into a host genome. In 
illustrative examples of this type, the expression vectors are viral vectors, such as simian virus 
40 (SV40) or bovine papilloma virus (BPV), which has the ability to replicate as extra- 
chromosomal elements (Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Gluzman 
5 ed., 1982; Sarver et al, 1981, MoL Cell. Biol. 1:486). Viral vectors include retroviral 

(lentivirus), adeno-associated virus (see, e.g., Okada, 1996, Gene Ther. 3:957-964; Muzyczka, 
1994, J. Clin. Invst. 94:1351; U.S. Pat. Nos. 6,156,303; 6,143,548 5,952,221, describing AAV 
vectors; see also U.S. Pat. Nos. 6,004,799; 5,833,993), adenovirus (see, e.g., U.S. Pat. Nos. 
6,140,087; 6,136,594; 6,133,028; 6,120,764), reovirus, herpesvirus, rotavirus genomes etc., 
10 modified for introducing and directing expression of a polynucleotide or transgene in cells. 
Retroviral vectors can include those based upon murine leukemia virus (see, e.g., U.S. Pat. 
No. 6,132,731), gibbon ape leukemia virus (see, e.g., U.S. Pat. No. 6,033,905), simian 
immuno-deficiency virus, human immuno-deficiency virus (see, e.g., U.S. Pat. No. 
5,985,641), and combinations thereof. 

15 [0137] Vectors also include those that efficiently deliver genes to animal cells in 

vivo (e.g., stem cells) (see, e.g., U.S. Pat. Nos. 5,821,235 and 5,786,340; Croyle et al, 1998, 
Gene Ther. 5:645; Croyle etal, 1998, Pharm. Res. 15:1348; Croyle etal, 1998, Hum. Gene 
Ther. 9:561; Foreman et al, 1998, Hum. Gene Ther. 9:1313 ; Wirtz et al, 1999, Gut 44:800). 
Adenoviral and adeno-associated viral vectors suitable for in vivo delivery are described, for 

20 example, in U.S. Pat. Nos. 5,700,470, 5,731,172 and 5,604,090. Additional vectors suitable 
for in vivo delivery include herpes simplex virus vectors (see, e.g., U.S. Pat. No. 5,501,979), 
retroviral vectors (see, e.g., U.S. Pat. Nos. 5,624,820, 5,693,508 and 5,674,703; and 
WO92/05266 and W092/14829), bovine papilloma virus (BPV) vectors (see, e.g., U.S. Pat. 
No. 5,719,054), CMV-based vectors (see, e.g., U.S. Pat. No. 5,561,063) and parvovirus, 

25 rotavirus and Norwalk virus vectors. Lentiviral vectors are useful for infecting dividing as 
well as non-dividing cells (see, e.g., U.S. Pat. No. 6,013,516). 

[0138] Additional viral vectors which will find use for delivering the nucleic acid 
molecules encoding the antigens of interest include those derived from the pox family of 
viruses, including vaccinia virus and avian poxvirus. By way of example, vaccinia virus 
30 recombinants expressing the chimeric constructs can be constructed as follows. The antigen 
coding sequence is first inserted into an appropriate vector so that it is adjacent to a vaccinia 
promoter and flanking vaccinia DNA sequences, such as the sequence encoding thymidine 
kinase (TK). This vector is then used to transfect cells that are simultaneously infected with 
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vaccinia. Homologous recombination serves to insert the vaccinia promoter plus the gene 
encoding the coding sequences of interest into the viral genome. The resulting TK- 
recombinant can be selected by culturing the cells in the presence of 5-bromodeoxyuridine 
and picking viral plaques resistant thereto. 

5 [0139] Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 

can also be used to deliver the genes. Recombinant avipox viruses, expressing immunogens 
from mammalian pathogens, are known to confer protective immunity when administered to 
non-avian species. The use of an avipox vector is particularly desirable in human and other 
mammalian species since members of the avipox genus can only productively replicate in 
1 0 susceptible avian species and therefore are not infective in mammalian cells. Methods for 

producing recombinant avipoxviruses are known in the art and employ genetic recombination, 
as described above with, respect to the production of vaccinia viruses. See, e.g., WO 
91/12882; WO 89/03429; and WO 92/03545. 

[0140] Molecular conjugate vectors, such as the adenovirus chimeric vectors 
15 described in Michael et al., J. Biol. Chem. (1993) 268:6866-6869 and Wagner et al., Proc. 
Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery. 

[0141] Members of the Alphavirus genus, such as, but not limited to, vectors 
derived from the Sindbis virus (SIN), Semliki Forest virus (SFV), and Venezuelan Equine 
Encephalitis virus (VEE), will also find use as viral vectors for delivering the chimeric 

20 constructs of the present invention. For a description of Sindbis-virus derived vectors useful 
for the practice of the instant methods, see, Dubensky et al (1996, J. Virol. 70:508-519; and 
International Publication Nos. WO 95/07995, WO 96/17072); as well as, Dubensky, Jr., T. 
W., et al, U.S. Pat. No. 5,843,723, and Dubensky, Jr., T. W., U.S. Pat. No. 5,789,245. 
Exemplary vectors of this type are chimeric alphavirus vectors comprised of sequences 

25 derived from Sindbis virus and Venezuelan equine encephalitis virus. See, e.g., Perri et al 
(2003, J. Virol. 77: 10394-10403) and International Publication Nos. WO 02/099035, WO 
02/080982, WO 01/81609, and WO 00/61772. 

[0142] In other illustrative embodiments, lentiviral vectors are employed to deliver 

a chimeric construct of the inevntion into selected cells or tissues. Typically, these vectors 

30 comprise a 5' lentiviral LTR, a tRNA binding site, a packaging signal, a promoter operably 

linked to one or more genes of interest, an origin of second strand DNA synthesis and a 3 5 

lentiviral LTR, wherein the lentiviral vector contains a nuclear transport element. The nuclear 

transport element may be located either upstream (5') or downstream (3 5 ) of a coding 
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sequence of interest (for example, a synthetic Gag or Env expression cassette of the present 
invention). A wide variety of lentiviruses may be utilized within the context of the present 
invention, including for example, lentiviruses selected from the group consisting of HIV, 
HIV-1, HIV-2, FIV, BIV, EIAV, MVV, CAEV, and SIV. Illustrative examples of lentiviral 
5 vectors are described in PCT Publication Nos. WO 00/66759, WO 00/00600, WO 99/24465, 
WO 98/51810, WO 99/51754, WO 99/31251, WO 99/30742, and WO 99/15641. Desirably, a 
third generation SIN lentivirus is used. Commercial suppliers of third generation SIN (self- 
inactivating) lentiviruses include Invitrogen (ViraPower Lentiviral Expression System). 
Detailed methods for construction, transfection, harvesting, and use of lentiviral vectors are 

1 0 given, for example, in the Invitrogen technical manual "ViraPower Lentiviral Expression 
System version B 050102 25-0501", available at http://www.invitrogen.com/Content/Tech- 
Online/molecular_biology/manualsj-ps/virapower_lentiviral_system_man.pdf. Lentiviral 
vectors have emerged as an efficient method for gene transfer. Improvements in biosafety 
characteristics have made these vectors suitable for use at biosafety level 2 (BL2). A number 

15 of safety features are incorporated into third generation SIN (self-inactivating) vectors. 

Deletion of the viral 3' LTR U3 region results in a pro virus that is unable to transcribe a full 
length viral RNA. In addition, a number of essential genes are provided in trans, yielding a 
viral stock that is capable of but a single round of infection and integration. Lentiviral vectors 
have several advantages, including: 1) pseudotyping of the vector using amphotropic 

20 envelope proteins allows them to infect virtually any cell type; 2) gene delivery to quiescent, 
post mitotic, differentiated cells, including neurons, has been demonstrated; 3) their low 
cellular toxicity is unique among transgene delivery systems; 4) viral integration into the 
genome permits long term transgene expression; 5) their packaging capacity (6-14 kb) is 
much larger than other retroviral, or adeno-associated viral vectors. In a recent demonstration 

25 of the capabilities of this system, lentiviral vectors expressing GFP were used to infect murine 
stem cells resulting in live progeny, germline transmission, and promoter-, and tissue-specific 
expression of the reporter (Ailles, L. E. andNaldini, L. a HIV-1 -Derived Lentiviral Vectors. 
In: Trono,D. (Ed.), Lentiviral Vectors, Springer-Verlag, Berlin, Heidelberg, New York, 2002, 
pp. 3 1-52). An example of the current generation vectors is outlined in FIG. 2 of a review by 

30 Lois et al (2002, Science, 295 868-872). 

[0143] The chimeric construct can also be delivered without a vector. For example, 

the chimeric construct can be packaged as DNA or RNA in liposomes prior to delivery to the 

subject or to cells derived therefrom. Lipid encapsulation is generally accomplished using 

liposomes which are able to stably bind or entrap and retain nucleic acid. The ratio of 

- 64 - 



WO 2009/049350 



PCT/AU2008/001463 



condensed DNA to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight, (1991, Biochim. Biophys. Acta. 1097:1-17); 
and Straubinger et aL 9 in Methods of Enzymology (1983), Vol. 101, pp. 512-527. 

5 [0144] Liposomal preparations for use in the present invention include cationic 

(positively charged), anionic (negatively charged) and neutral preparations, with cationic 
liposomes particularly preferred. Cationic liposomes have been shown to mediate intracellular 
delivery of plasmid DNA (Feigner et al, 1987, Proc. Natl. Acad. Sci. USA 84:7413-7416); 
mRNA (Malone et aU 1989, Proc. Natl Acad. Sci. USA 86:6077-6081); and purified 
10 transcription factors (Debs et al, 1990, J. Biol. Chem. 265:10189-10192), in functional form. 

[0145] Cationic liposomes are readily available. For example, N[l-2,3- 
dioleyloxy)propyl]-N,N,N-triethylammomtim (DOTMA) liposomes are available under the 
trademark Lipofectin, from GIBCO BRL, Grand Island, N.Y. (See, also, Feigner et al 9 1987, 
Proc. Natl. Acad. Sci. USA 84:7413-7416). Other commercially available lipids include 
15 (DDAB/DOPE) and DOTAP/DOPE (Boerhinger). Alternative cationic liposomes can be 
prepared from readily available materials using techniques well known in the art. See, e.g., 
Szoka et al, 1978, Proc. Natl. Acad. Sci. USA 75:4194-4198; PCT Publication No. WO 
90/11092 for a description of the synthesis of DOTAP (l,2-bis(oleoyloxy)-3- 
(trimethylammonio)propane) liposomes. 

20 [0146] Similarly, anionic and neutral liposomes are readily available, such as, from 

Avanti Polar Lipids (Birmingham, Ala.), or can be easily prepared using readily available 
materials. Such materials include phosphatidyl choline, cholesterol, phosphatidyl 
ethanolamine, dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 

25 with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

[0147] The liposomes can comprise multilammelar vesicles (MLVs), small 

unilamellar vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome- 

nucleic acid complexes are prepared using methods known in the art. See, e.g., Straubinger et 

30 a/., in METHODS OF IMMUNOLOGY (1983), Vol. 101, pp. 512-527; Szoka et al, 1978, 

Proc. Natl. Acad. Sci. USA 75:4194-4198; Papahadjopoulos et aL 9 1975, Biochim. Biophys. 

Acta 394:483; Wilson et al, 1979, Cell 17:77); Deamer and Bangham, 1976, Biochim. 

Biophys. Acta 443:629; Ostro et at, 1977, Biochem. Biophys. Res. Commun. 76:836; Fraley 
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etal, 1979, Proc. Natl. Acad. Sci. USA 76:3348); Enoch and Strittmatter, 1979, Proc. Natl. 
Acad. Sci. USA 76:145); Fraley et al, 1980, J. Biol. Chem. 255:10431; Szoka and 
Papahadjopoulos, 1978, Proc. Natl. Acad. Sci. USA 75:145; and Schaefer-Ridder et al 9 1982, 
Science 215:166. 

5 [0148] The chimeric construct can also be delivered in cochleate lipid compositions 

similar to those described by Papahadjopoulos et al, 1975, Biochem. Biophys. Acta. 394:483- 
491. See, also, U.S. Pat. Nos. 4,663,161 and 4,871,488. 

[0149] The chimeric construct may also be encapsulated, adsorbed to, or associated 
with, particulate carriers. Such carriers present multiple copies-of a selected chimeric 

10 construct to the immune system. The particles can be taken up by professional antigen 
presenting cells such as macrophages and dendritic cells, and/or can enhance antigen 
presentation through other mechanisms such as stimulation of cytokine release. Examples of 
particulate carriers include those derived from polymethyl methacrylate polymers, as well as 
microparticles derived from poly(lactides) and poly(lactide-co-glycolides), known as PLG. 

15 See, e.g., Jeffery et al, 1993, Pharm. Res. 10:362-368; McGee J. P., et al, 1997, J 
Microencapsul. 14(2): 197-210; O'HaganD. T., et al, 1993, Vaccine ll(2):149-54. 

[0150] Furthermore, other particulate systems and polymers can be used for the in 
vivo delivery of the chimeric construct. For example, polymers such as polylysine, 
polyarginine, polyomithine, spermine, spermidine, as well as conjugates of these molecules, 

20 are useful for transferring a nucleic acid of interest. Similarly, DEAE dextran-mediated 

transfection, calcium phosphate precipitation or precipitation using other insoluble inorganic 
salts, such as strontium phosphate, aluminum silicates including bentonite and kaolin, 
chromic oxide, magnesium silicate, talc, and the like, will find use with the present methods. 
See, e.g., Feigner, P. L., Advanced Drug Delivery Reviews (1990) 5:163-187, for a review of 

25 delivery systems useful for gene transfer. Peptoids (Zuckerman, R. N., et aL, U.S. Pat. No. 
5,83 1,005, issued Nov. 3, 1998) may also be used for delivery of a construct of the present 
invention. 

[0151] Additionally, biolistic delivery systems employing particulate carriers such 

as gold and tungsten, are especially useful for delivering chimeric constructs of the present 

30 invention. The particles are coated with the synthetic expression cassette(s) to be delivered 

and accelerated to high velocity, generally under a reduced atmosphere, using a gun powder 

discharge from a "gene gun." For a description of such techniques, and apparatuses useful 

therefor, see, e.g., U.S. Pat. Nos. 4,945,050; 5,036,006; 5,100,792; 5,179,022; 5,371,015; and 
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5,478,744. In illustrative examples, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by PowderMed Pharmaceuticals PLC (Oxford, UK) and 
PowderMed Vaccines Inc. (Madison, Wis.), some examples of which are described in U.S. 
Pat. Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; andEP Patent No. 0500 799. This 
5 approach offers a needle-free delivery approach wherein a dry powder formulation of 

microscopic particles, such as polynucleotide or polypeptide particles, are accelerated to high 
speed within a helium gas jet generated by a hand held device, propelling the particles into a 
target tissue of interest. Other devices and methods that may be useful for gas-driven needle- 
less injection of compositions of the present invention include those provided by Bioject, Inc. 
10 (Portland, Oreg.), some examples of which are described in U.S. Pat. Nos. 4,790,824; 
5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 and 5,993,412. 

[0152] Alternatively, micro-cannula- and microneedle-based devices (such as those 
being developed by Becton Dickinson and others) can be used to administer the chimeric 
constructs of the invention. Illustrative devices of this type are described in EP 1 092 444 Al, 

15 and U.S. application Ser. No. 606,909, filed Jun. 29, 2000. Standard steel cannula can also be 
used for intra-dermal delivery using devices and methods as described in U.S. Ser. No. 
417,671, filed Oct. 14, 1999. These methods and devices include the delivery of substances 
through narrow gauge (about 30 G) "micro-cannula" with limited depth of penetration, as 
defined by the total length of the cannula or the total length of the cannula that is exposed 

20 beyond a depth-limiting feature. It is within the scope of the present invention that targeted 
delivery of substances including chimeric constructs can be achieved either through a single 
microcannula or an array of microcannula (or "microneedles"), for example 3-6 microneedles 
mounted on an injection device that may include or be attached to a reservoir in which the 
substance to be administered is contained. 

25 7. Compositions 

[0153] The invention also provides compositions, particularly immunomodulating 
compositions, comprising one or more of the chimeric constructs described herein. The 
immunomodulating compositions may comprise a mixture of chimeric constructs, which in 
turn may be delivered, for example, using the same or different vectors or vehicles. Antigens 

30 may be administered individually or in combination, in e.g. , prophylactic (i. e. , to prevent 
infection or disease) or therapeutic (to treat infection or disease) immunomodulating 
compositions. The immunomodulating compositions may be given more than once (e.g., a 
"prime" administration followed by one or more "boosts") to achieve the desired effects. The 
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same composition can be administered in one or more priming and one or more boosting 
steps. Alternatively, different compositions can be used for priming and boosting. 

[0154] The immunomodulating compositions will generally include one or more 
"pharmaceutically acceptable excipients or vehicles" such as water, saline, glycerol, ethanol, 
5 etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering 
substances, and the like, may be present in such vehicles. 

[0155] Immunomodulating compositions will typically, in addition to the 
components mentioned above, comprise one or more "pharmaceutically acceptable carriers." 
These include any carrier which does not itself induce the production of antibodies harmful to 

10 the individual receiving the composition. Suitable carriers typically are large, slowly 

metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, and lipid aggregates (such as oil 
droplets or liposomes). Such carriers are well known to those of ordinary skill in the art. A 
composition may also contain a diluent, such as water, saline, glycerol, etc. Additionally, an 

1 5 auxiliary substance, such as a wetting or emulsifying agent, pH buffering substance, and the 
like, may be present. A thorough discussion of pharmaceutically acceptable components is 
available in Gennaro (2000) Remington: The Science and Practice of Pharmacy. 20th ed., 
ISBN: 0683306472. 

[0156] Pharmaceutically compatible salts can also be used in compositions of the 
20 invention, for example, mineral salts such as hydrochlorides, hydrobromides, phosphates, or 
sulfates, as well as salts of organic acids such as acetates, proprionate, malonates, or 
benzoates. Especially useful protein substrates are serum albumins, keyhole limpet 
hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other 
proteins well known to those of skill in the art. 

25 [0157] The chimeric constructs of the invention can also be adsorbed to, entrapped 

within or otherwise associated with liposomes and particulate carriers such as PLG. 

[0158] The chimeric constructs of the present invention are formulated into 
compositions for delivery to a mammal. These compositions may either be prophylactic (to 
30 prevent infection) or therapeutic (to treat disease after infection). The compositions will 

comprise a "therapeutically effective amount" of the gene of interest such that an amount of 
the antigen can be produced in vivo so that an immune response is generated in the individual 
to which it is administered. The exact amount necessary will vary depending on the subject 
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being treated; the age and general condition of the subject to be treated; the capacity of the 
subject's immune system to synthesize antibodies; the degree of protection desired; the 
severity of the condition being treated; the particular antigen selected and its mode of 
administration, among other factors. An appropriate effective amount can be readily 
5 determined by one of skill in the art. Thus, a "therapeutically effective amount" will fall in a 
relatively broad range that can be determined through routine trials. 

[0159] Once formulated, the compositions of the invention can be administered 
directly to the subject (e.g., as described above). Direct delivery of chimeric construct- 
containing compositions in vivo will generally be accomplished with or without vectors, as 

10 described above, by injection using either a conventional syringe, needless devices such as 

Bioject™ or a gene gun, such as the Accell™ gene delivery system (PowderMed Ltd, Oxford, 
England) or microneedle device. The constructs can be delivered (e.g., injected) either 
subcutaneously, epidermally, intradermally, intramuscularly, intravenous, intramucosally 
(such as nasally, rectally and vaginally), intraperitoneally or orally. Delivery of nucleic acid 

1 5 into cells of the epidermis is particularly preferred as this mode of administration provides 

access to skin-associated lymphoid cells and provides for a transient presence of nucleic acid 
(e.g., DNA) in the recipient. Other modes of administration include oral ingestion and 
pulmonary administration, suppositories, needle-less injection, transcutaneous, topical, and 
transdermal applications. Dosage treatment may be a single dose schedule or a multiple dose 

20 schedule. 

[0160] In order that the invention may be readily understood and put into practical 
effect, particular preferred embodiments will now be described by way of the following non- 
limiting examples. 
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EXAMPLES 

EXAMPLE 1 

Synthetic Construct System for Determining the Immune Response Preference 

of Codons in Mammals 

5 Materials and Methods 

Primer design/ synthesis and sequence manipulation 
[0161] Oligonucleotides for site-directed mutagenesis were designed according to 
the guidelines included in the mutagenesis kit manuals (Quikchange II Site-directed 
Mutagenesis kit or Quikchange Multi Site-directed Mutagenesis Kit; Stratagene, La Jolla 
10 CA). These primers were synthesized and PAGE purified by Sigma (formerly Proligo). 

[0162] Oligonucleotides for whole gene synthesis were designed by eye and 
synthesized by Sigma (formerly Proligo). The primers were supplied as standard desalted 
oligos. No additional purification of the oligonucleotides was carried out. 

[0163] Sequence manipulation and analysis was carried out using the suite of 
1 5 programs on Biomanager (ANGIS) and various other web-based programs including BLAST 
at NCBI r httD://www.ncbi.nlm.nih.qov/blast/bl2sea/wblast2.cai\ NEBcutter V2.0 from New 
England Biolabs ( httD://tools.neb.com/NEBcutter2/index.php\ the Translate Tool on ExPASy 
( http://au.expasv.orq/tools/dna.html \ and the SignalP 3.0 server 
(http://w\AAA^.cbs.dtu.dk/sen/ices/SiqnalP/\ 

20 Standard cloning techniques 

[0164] Restriction enzyme digests, alkaline phosphatase treatments and ligations 
were carried out according to the enzyme manufacturers' instructions (various manufacturers 
including New England Biolabs, Roche and Fermentas). 

[0165] Purification of DNA from agarose gels and preparation of mini-prep DNA 
25 were carried out using commercial kits (Qiagen, Bio-Rad, Macherey-Nagel). 

[0166] Agarose gel electrophoresis, phenol/chloroform extraction of contaminant 
protein from DNA, ethanol precipitation of DNA and other basic molecular biological 
procedures were carried out using standard protocols, similar to those described in Current 
Protocols in Molecular Biology (Ebook available via Wiley InterScience; edited by Ausubel 
30 etal). 
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[0167] Sequencing was carried out by the Australian Genome Research Facility 
(AGRF, Brisbane). 

Whole gene synthesis 
[0168] Overlapping ~35-50mer oligonucleotides (Sigma-Proligo) were used to 
5 synthesize longer DNA sequences. Restriction enzyme sites were incorporated to facilitate 
cloning. The method used to synthesize the fragments is based on that given in Smith et al. 
(2003). First, oligonucleotides for the top or bottom strand were mixed and then 
phosphorylated using T4 polynucleotide kinase (PNK; New England Biolabs). The 
oligonucleotide mixes were then purified from the PNK by a standard phenol/ chloroform 
10 extraction and sodium acetate/ ethanol (NaAc/EtOH) precipitation. Equal volumes of 
oligonucleotide mixes for the top and bottom strands were then mixed and the 
oligonucleotides denatured by heating at 95°C for 2 mins. The oligonucleotides were annealed 
by slowly cooling the sample to 55°C and the annealed oligonucleotides ligated using Taq 
ligase (New England Biolabs). The resulting fragment was purified by phenol/CHCl 3 
1 5 extraction and NaAc/EtOH precipitation. 

[0169] The ends of the fragments were filled in and the fragments then amplified, 
using the outermost forward and reverse primers, with the Clontech Advantage HF 2 PGR kit 
(Clontech) according to the manufacturer's instructions. To fill in the ends the following PGR 
was used: 35 cycles of a denaturation step of 94°C for 15s, a slow annealing step where the 
20 temperature was ramped down to 55°C over 7 minutes and then kept at 55°C for 2 min, and 
an elongation step of 72°C for 6 minutes. A final elongation step for 7min at 72°C was then 
carried out. The second PGR to amplify the fragment involved: an initial denaturation step at 
94°C for 30s, followed by 25 cycles of 94°C for 15s, 55°C 30s and 68°C for 1 min, and a final 
elongation step of 68°C for 3 mins. 

25 [0170] The fragments were then purified by gel electrophoresis, digested and 

ligated into the relevant vector. Following transformation of E. coli with the ligation mixture, 
mini-preps were made for multiple colonies and the inserts sequenced. Sometimes it was not 
possible to isolate clones with entirely correct sequence. In those cases the errors were fixed 
by single or multi site-directed mutagenesis. 

30 Site-directed mutagenesis 

[0171] Mutagenesis was carried out using the Quikchange II Site-directed 

Mutagenesis kit or Quikchange Multi Site-directed Mutagenesis Kit (Stratagene, La Jolla 
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CA), with appropriate PAGE (polyacrylamide gel electrophoresis)-purified primers (Sigma), 
according to the manufacturer's instructions. 

r 

Preparation of constructs 
[0172] The details of the constructs used to generate the codon preference table are 
summarized in TABLE 12. All constructs were made using pCDNA3 from Invitrogen and 
were verified by sequencing prior to use. 



TABLE 12 

Summary of Secretory E7 Construct Series 1 and 2 



Construct 


AA & Codon 


CU of Sec Seq 


CU ofE7 


E7 Protein 


Control Constructs 


IgkCl 


N/A 


wt 


wt 


non-one 


IgkC2 


N/A 


mc 


mc 


non-one 


IgkC3 


N/A 


wt 


wt 


one 


IgkC4 


N/A 


mc 


mc 


one 


Secretory E7 construct series 1 


IgkSl-1 


Ala GCG 


wt 


wt with all Ala 

gcg 


non-one 


IgkSl-2 


Ala GCA 


wt 


wt with all Ala 
gca 


non-one 


IgkSl-3 


Ala GCT 


wt 


wt with all Ala 
get 


non-one 


IgkSl-4 


Ala GCC 


wt 


wt with all Ala 
gec 


non-one 


IgkSl-5 


Arg AGG 


wt 


wt with all Arg 
agg 


non-one 


IgkSl-6 


Arg AGA 


wt 


wt with all Arg 
aga 


non-one 


IgkSl-7 


Arg CGG 


wt 


wt with all Arg 
egg 


non-one 


IgkSl-8 


Arg CGA 


wt 


wt with all Arg 
cga 


non-one 


IgkSl-9 


Arg CGT 


wt 


wt with all Arg 


non-one 
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v^onsLruci 




tu oi oec oeq 


\^\J OI l 


E7 Protein 








cgt 




IgkSl-10 


ArgCGC 


wt 


wt with all Arg 
cgc 


non-one 


IgkSl-11 


Asn AAT 


wt 


wt with all Asn 
aat 


non-one 


IgkSl-12 


Asn AAC 


wt 


wt with all Asn 
aac 


non-one 


IgkSl-13 


Asp GAT 


wt with all Asp 
gat 


wt with all Asp 
gat 


non-one 


IgkSl-14 


Asp GAC 


wt with all Asp 
gac 


wt with all Asp 
gac 


non-one 


IgkSl-15 


Cys TGT 


wt 


wt with all Cys 
tgt 


non-one 


IgkSl-16 


Cys TGC 


wt 


wt with all Cys 
tgc 


non-one 


IgkSl-17 


Glu GAG 


wt with all Glu 
gag 


wt with all Glu 
gag 


non-one 


IgkSl-18 


GluGAA 


wt with all Glu 
gaa 


wt with all Glu 
gaa 


non-one 


IgkSl-19 


Gin CAG 


wt 


wt with all Gin 
cag 


non-one 


IgkSl-20 


Gin CAA 


wt 


wt with all Gin 
caa 


non-one 


IgkSl-21 


Gly GGG 


wt with all Gly 
ggg 


wt with all Gly 
ggg 


non-one 


IgkSl-22 


Gly GGA 


wt with all Gly 
gga 


wt with all Gly 
gga 


non-one 


IgkSl-23 


Gly GGT 


wt with all Gly 
ggt 


wt with all Gly 
ggt 


non-one 


IgkSl-24 


Gly GGC 


wt with all Gly 
ggc 


wt with all Gly 
ggc 


non-one 


IgkSl-25 


His CAT 


wt 


wt with all His 
cat 


non-one 


IgkSl-26 


His CAC 


wt 


wt with all His 
cac 


non-one 
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Construct 


AA & Codon 


CUofSecSeq 


CU ofE7 


E7 Protein 


IgkSl-27 


He ATA 


wt 


wt with all He 
ata 


non-one 


IgkSl-28 


He ATT 


wt 


wt with all He 
att 


non-one | 


IgkSl-29 


He ATC 


wt 


wt with all He 
ate 


non-one 


IgkSl-30 


Lys AAG 


wt 


wt with all Lys 
aag 


non-one 


IgkSl-31 


Lys AAA 


wt 


wt with all Lys 
acta 


non-one 


IgkSl-32 


PheTTT 


wt 


wt with all Phe 
ttt 


non-one L15F, 
L22F 


IgkSl-33 


Phe TTC 


wt 


wt with all Phe 
ttc 


non-one L15F, 
L22F 


IgkSl-34 


Ser AGT 


wt with all Ser 
agt 


wt with all Ser 
agt 


non-one 


IgkSl-35 


Ser AGC 


wt with all Ser 
age 


wt with all Ser 
age 


non-one 


IgkSl-36 


Ser TCG 


wt with all Ser 
teg 


wt with all Ser 
teg 


non-one 


IgkSl-37 


Ser TCA 


wt with all Ser 
tea 


wt with all Ser 
tea 


non-one 


IgkSl-38 


Ser TCT 


wt with all Ser 
tct 


wt with all Ser 
tct 


non-one 


IgkSl-39 


Ser TCC 


wt 


wt with all Ser 
tec 


non-one 


IgkSl-40 


Thr ACG 


wt with all Thr 
acg 


wt with all Thr 
acg 


non-one 


IgkSl-41 


Thr ACA 


wt with all Thr 
aca 


wt with all Thr 
aca 


non-one 


IgkSl-42 


Thr ACT 


wt with all Thr 
act 


wt with all Thr 
act 


non-one 


IgkSl-43 


Thr ACC 


wt with all Thr 
acc 


wt with all 1 nr 

acc 


non-one 


IgkSl-44 


Tyr TAT 


wt 


wt with all Tyr 
tat 


non-one 
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Construct 


AA & Codon 


CU of Sec Seq 


CU ofE7 


E7 Protein 


IgkSl-45 


Tyr TAC 


wt 


wt with all Tyr 
tac 


non-one 


IgkSl-46 


Val GTG 


wt with all Val 
gtg 


wt with all Val 
gtg 


non-one 


IgkSl-47 


Val GTA 


wt with all Val 
gta 


wt with all Val 
gta 


non-one 


IgkSl-48 


Val GTT 


wt with all Val 
gtt 


wt with all Val 
gtt 


non-one 


IgkSl-49 


Val GTC 


wt with all Val 
gtc 


wt with all Val 
gtc 


non-one 


IgkSl-50 


Leu CTG 


altered with Leu 
ctg 


altered with Leu 
ctg 


one 


IgkSl-51 


Leu CTA 


altered with Leu 
eta 


altered with Leu 
eta 


one 


IgkSl-52 


Leu CTT 


altered with Leu 
ctt 


altered with Leu 
ctt 


one 


IgkSl-53 


Leu CTC 


altered with Leu 
etc 


altered with Leu 
etc 


one 


IgkSl-54 


Leu TTG 


altered with Leu 
ttg 


altered with Leu 
ttg 


one 


IgkSl-55 


Leu TTA 


altered with Leu 
tta 


altered with Leu 
tta 


one 


IgkSl-56 


Pro CCG 


altered with Pro 
ccg 


altered with Pro 
ccg 


one 


IgkSl-57 


Pro CCA 


altered with Pro 
cca 


altered with Pro 
cca 


one 


IgkSl-58 


Pro CCT 


altered with Pro 
cct 


altered with Pro 
cct 


one 


IgkSl-59 


Pro CCC 


altered with Pro 
ccc 


altered with Pro 
ccc 


one 


Secretory E7 construct series 2 


IgkS2-l 


Ala GCG 


mc 


mc 


linkerA-onc 


IgkS2-2 


Ala GCA 


mc 


mc 


linkerA-onc 


IgkS2-3 


Ala GCT 


mc 


mc 


linkerA-onc 
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Construct 


AA & Codon 


CU of Sec Seq 


CU ofE7 


E7 Protein 


IgkS2-4 


Ala GCC 


mc 


mc 


linkerA-onc 


IgkS2-5 


Arg AGG 


mc 


mc 


linkerR-onc 


IgkS2-6 


Arg AGA 


mc 


mc 


linkerR-onc 


IgkS2-7 


ArgCGG 


mc 


mc 


linkerR-onc 


IgkS2-8 


Arg CGA 


mc 


mc 


linkerR-onc 


IgkS2-9 


Arg CGT 


mc 


mc 


linkerR-onc 


IgkS2-10 


ArgCGC 


mc 


mc 


linkerR-onc 


IgkS2-ll 


Asn AAT 


mc 


mc 


linkerN-onc 


IgkS2-12 


Asn AAC 


mc 


mc 


linkerN-onc 


IgkS2-13 


Asp GAT 


wt with all Asp 
gat 


wt with all Asp 
gat 


one 


IgkS2-14 


Asp GAC 


wt with all Asp 
gac 


wt with all Asp 
gac 


one 


IgkS2-15 


Cys TGT 


wt 


wt with all Cys 
tgt 


one 


IgkS2-16 


Cys TGC 


wt 


wt with all Cys 
tgc 


one ! 


IgkS2-17 


Glu GAG 


wt with all Glu 
gag 


wt with all Glu 
gag 


one 


IgkS2-18 


GluGAA 


wt with all Glu 
gaa 


wt with all Glu 
gaa 


one 


IgkS2-19 


Gin CAG 


wt 


wt with all Gin 
cag 


one 


IgkS2-20 


Gin CAA 


wt 


wt with all Gin 
caa 


one 


IgkS2-21 


Gly GGG 


wt with all Gly 
ggg 


wt with all Gly 
ggg 


one 


IgkS2-22 


Gly GGA 


wt with all Gly 
gga 


wt with all Gly 
gga 


one 


IgkS2-23 


Gly GGT 


wt with all Gly 
ggt 


wt with all Gly 
ggt 


one 


IgkS2-24 


Gly GGC 


wt with all Gly 
ggc 


wt with all Gly 
ggc 


one 
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Construct 


AA & Codon 


CUofSec Seq 


CU ofE7 


E7 Protein 


IgkS2-25 


His CAT 


mc 


mc 


linkerH-onc 


IgkS2-26 


His CAC 


mc 


mc 


linkerH-onc 


IgkS2-27 


He ATA 


wt 


wt with all He 
ata 


one 


IgkS2-28 


He ATT 


wt 


wt with all He 
att 


one 


IgkS2-29 


He ATC 


wt 


wt with all He 
ate 


one 


IgkS2-30 


Lys AAG 


mc 


mc 


linkerK-onc 


IgkS2-31 


Lys AAA 


mc 


mc 


linkerK-onc 


IgkS2-32 


Phe TTT 


mc 


mc 


linkerF-onc 


IgkS2-33 


Phe TTC 


mc 


mc 


linkerF-onc 


IgkS2-34 


Ser AGT 


wt with all Ser 
agt 


wt with all Ser 
agt 


one 


IgkS2-35 


Ser AGC 


wt with all Ser 
age 


wt with all Ser 
age 


one 


IgkS2-36 


Ser TCG 


wt with all Ser 
teg 


wt with all Ser 
teg 


one 


IgkS2-37 


Ser TCA 


wt with all Ser 
tea 


wt with all Ser 
tea 


one 


IgkS2-38 


Ser TCT 


wt with all Ser 
tct 


wt with all Ser 
tct 


one 


IgkS2-39 


Ser TCC 


wt 


wt with all Ser 
tec 


one 


IgkS2-40 


Thr ACG 


wt with all Thr 
acg 


wt with all Thr 
acg 


one 


IgkS2-41 


Thr ACA 


wt with all Thr 
aca 


wt with all Thr 
aca 


one 


IgkS2-42 


Thr ACT 


wt with all Thr 
act 


wt with all Thr 
act 


one 


IgkS2-43 


Thr ACC 


wt with all Thr 
acc 


wt with all Thr 
acc 


one 


IgkS2-44 


Tyr TAT 


mc 


mc 


linkerY-onc 


IgkS2-45 


Tyr TAC 


mc 


mc 


linkerY-onc 
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Construct 


AA & Codon 


CU of Sec Seq 


CU of E7 


E7 Protein 


IgkS2-46 


Val GTG 


wt with all Val 
gtg 


wt with all Val 
gtg 


one 


IgkS2-47 


Val GTA 


wt with all Val 
gta 


wt with all Val 
gta 


one 


IgkS2-48 


Val GTT 


wt with all Val 
gtt 


wt with all Val 
gtt 


one 


IgkS2-49 


Val GTC 


wt with all Val 
gtc 


wt with all Val 
gtc 


one 


IgkS2- 
lib 


Asn AAT 


wt 


wt with all Asn 
aat 


linkerN-non-onc 


IgkS2- 
12b 


Asn AAC 


wt 


wt with all Asn 
aac 


linkerN-non-onc 



AA= amino acid, CU= codon usage, mc= mammalian consensus, wt= wild-type, onc= 
oncogenic, non-onc= non-oncogenic, Sec seq= secretory sequence, N/A= not applicable 

Control constructs 

5 [0173] Control E7 constructs were based on those from Liu et al (2002). Both 

oncogenic {i.e. wild-type) and non-oncogenic E7 control constructs were made with wild-type 
or mammalian consensus codon usage. "Non-oncogenic" E7 is E7 with D21G, C24G, E26G 
mutations, i.e. with mutations that have been reported to render E7 non-transforming 
(Edmonds and Vousden, 1989; Heck et al, 1992). 

10 [0174] The secretory sequence was derived from Mus musculus IgK RNA for the 

anti-HLA-DR antibody light chain (GenBank accession number D84070). For some 
constructs the codon usage of this sequence was modified. 

Wild-type codon usage control constructs: 
[0175] The wild-type (wt) codon usage E7 construct from Liu et al. was used as the 
1 5 template in a site-directed mutagenesis PCR to make the wt codon usage non-oncogenic E7 
construct. 

[0176] The non-oncogenic and oncogenic wild-type codon usage E7 sequences 
were amplified to incorporate a 5 5 BamUl site and a 3' EcoKl site. The resulting fragments 
were cloned into BamUl and EcoRl cut pCDNA3 and sequenced. The secretory fragment was 
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made by whole gene synthesis using wild-type codon usage with flanking Kpnl and BamHI 
sites. The Kozak-secretory fragments were then ligated into Kpnll BamHI cut pCDNA3-wtE7 
(non-oncogenic or oncogenic) to make pCDNA3-Igk-nE7 and pCDNA3-Igk-E7 (named 
IgkCl and IgkC3 respectively; see TABLE 12). The identity of the constructs was confirmed 
5 by sequencing. 

Mammalian consensus (mc) codon usage control constructs: 
[0177] As there were errors in the original mammalian consensus (mc) E7 construct 
(L2SF, Q70R and an E35 deletion; Liu et aL, 2002) it was not used. A mc non-oncogenic E7 
control construct was synthesized by whole gene synthesis. A mc oncogenic E7 {i.e., wild- 
10 type E7) control construct was subsequently made from the mc non-oncogenic E7 construct 
by single site-directed mutagenesis. 

[0178] Secretory mc oncogenic and non-oncogenic constructs were made by 
amplifying the mc E7 sequence with a forward primer that introduced a BamHI site and a 
reverse primer that incorporated an EcoKL site. The resulting E7 fragment was cloned into the 
15 respective sites in pCDNAS and sequenced. A mc secretory sequence flanked by Kpnl and 
BamHI sites, 5 5 and 3' respectively, was synthesised and ligated into the Kpnl and BamHI 
sites of pCDN A3 -mcE7 (oncogenic or non-oncogenic) to make pCDNA3-mdgk-mcnE7 and 
pCDNA3-mdgk-mcE7 (named IgkC2 and IgkC4 respectively; see TABLE 12). The 
identity of the constructs was confirmed by sequencing. 

20 Secreted non-oncogenic E7 constructs with predominantly wild-type codon 

usage, modified for individual codons 
[0179] Plasmids encoding a non-oncogenic form of E7 were made for all of the 
codons, with the exception of the Pro and Leu codons, stop codons and codons for non- 
degenerate amino acids. As Phe occurs just once in the E7 sequence, the codons for two Leu 

25 residues, LI 5 and L22, were mutated to Phe codons. A combination of techniques was used to 
make these constructs. When few mutations were required single or multi site-directed 
mutagenesis of a control construct encoding non-oncogenic E7 was performed (details of the 
control construct are given above under "control constructs"). When more extensive 
modifications were required whole gene synthesis was employed. Regardless of the methods 

30 used these constructs all include an E7 encoding sequence with identical upstream and 

downstream sequence cloned into the Kpnl and EcdBl sites of pCDNA3. These constructs 
were then modified to include a secretory sequence, as described below. 
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[0180] First, using the whole gene synthesis method, DNA fragments that included 
a secretory sequence flanked by Kpnl and BamHI sites were synthesized. For some constructs 
the amino acid of interest occurred in the secretory sequence so individual modified secretory 
sequence fragments were made. For constructs for amino acids that did not occur in the 
5 secretory sequence, wild-type secretory sequence was used. These fragments were digested 
with Kpnl and BamHI. Then, using the relevant nE7 construct as a template and a standard 
PGR protocol, a BamHI site was introduced at the 5 5 end of the E7 sequence. The 3' EcoKL 
site was retained. The resulting E7 fragments were cut with BamHI and EcoKL, purified, and 
ligated into pCDNAS. Following sequencing, the plasmids were cut with Kpnl and BamHI 
10 and ligated with the relevant KpnV BamHI secretory sequences. The sequences of the 

constructs were then confirmed. Constructs IgkSl-1 to IgkSl-49 were made in this way (see 
TABLE 12 and Figures 1 to 11, 13 and 15 to 17 for sequence comparisons). 

Secreted E7 constructs with individual Pro or Leu codons modified 
[0181] E7 DNA sequences in which the Pro or Leu codons were individually 
1 5 modified were designed. The rest of the codon usage for these E7 DNAs was the same for all 
of the Pro and Leu constructs but differed from the wild-type or mammalian consensus codon 
usage. [Note that this codon usage was based on our preliminary data from immunizing mice 
with the GFP constructs.] 

[0182] The Pro/LeuE7 DNA fragments, flanked by Hin&III and BamHI sites, were 
20 made by whole gene synthesis and cloned into the Hindlll and BamHI sites of pCDNA3. 

Using these constructs as templates, a KpnL site was incorporated upstream and an EcoKL site 
downstream, of the Pro/Leu E7 sequences by standard PGR methods. The resulting fragments 
were cut with Kpnl and EcoKL and cloned into pCDNA3. These constructs were then used to 
make the secreted E7 constructs with Pro or Leu codon modifications. 

25 [0183] Firstly, using the whole gene synthesis method, DNA fragments that 

included a secretory sequence flanked by Kpnl and BamHI sites were synthesized. As Pro and 
Leu occur in the secretory sequence, individually modified secretory sequence fragments 
were made for the different constructs. These fragments were digested with Kpnl and BamHI. 
Then, using the relevant Pro or Leu E7 construct as a template and a standard PGR protocol, a 

30 BamHI site was introduced at the 5' end of the E7 sequence. The 3> EcoKL site was retained. 
The resulting fragments were cut with BamHI and EcoKL, purified, and ligated into pCDNA3. 
Following sequencing, the plasmids were cut with Kpnl and BamHI and ligated with the 



-80- 



WO 2009/049350 PCT/AU2008/001463 



relevant KpriU BamBI secretory sequences. The resulting constructs were sequenced and are 
denoted IgkSl-50 to IgkSl-59 (see TABLE 12 and Figures 12 and 14 for sequence 
comparisons). 

Secreted E7 constructs with predominantly wild-tvpe codon usage, modified 
for individual codons 
[0184] Constructs encoding a secreted form of oncogenic E7 (i.e. wild-type E7 
protein) were made by site-directed mutagenesis of the plasmids encoding a secreted form of 
non-oncogenic E7. This was done for constructs for codons for the following amino acids: 
Asp, Cys, Glu, Gin, Gly, lie, Ser, Thr and Val. 

[0185] Site-directed mutagenesis was carried out using the Quikchange II Site- 
directed Mutagenesis kit (Stratagene, La Jolla CA) and appropriate PAGE (polyacrylamide 
gel electrophoresis)-purified primers (Sigma) according to the manufacturer's instructions. 
The pCDNA-kIgkX-nE7X series of constructs were used as templates for the mutagenesis 
(i.e. constructs IgkSl-13 to 24, IgkSl-27 to 29, IgkSl-34 to 43 and IgkSl-46 to 49). The 
primers introduced the desired G21D, G24C, G26E mutations. 

[0186] The resulting constructs, IgkS2-13 to 24, IgkS2-27 to 29, IgkS2-34 to 43 
and IgkS2-46 to 49 (see Table 8, SEQ ID NOs: 1 to 29), have wild-type codon usage for the 
Igk secretory sequence and E7 sequence with the exception that the codons for the relevant 
amino acid were changed, and they encode oncogenic E7. 

20 Linker constructs 

[0187] Constructs encoding the N-terminal Igk secretory sequence followed by a 
linker sequence (XXGXGXX, where X is the relevant amino acid for a particular construct 
and G is glycine) and the E7 protein were made for each of the following amino acids: Asn, 
Ala, Lys, Arg, Phe, His and Tyr. 

25 [0188] Fragments consisting of the Igk secretory sequence (with mammalian 

consensus codon usage) and the linker sequences were made by PGR using Taq polymerase 
and standard cycling conditions, as recommended by the manufacturer. 

[0189] The fragments were amplified from pCDNA3-kmcIgk-mcE7 using a 
common forward primer 

30 (5 'TTGAATAGGTACCGCCGCCACCATGGAGACCGACACCCTCC3 5 ; SEQ IDNO: 90) 
that annealed to the Kpnl site, the Kozak sequence and the beginning of the Igk secretory 
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sequence. The reverse primers were different for each linker construct and annealed to the end 
of the Igk secretory sequence (with mammalian consensus codon usage), introduced new 
sequence that encoded the relevant linker sequence and a 3' BamHl site. 

[0190] The fragments were digested with KpnVBamHl and were ligated into 
5 Kpnl/BamHI-out pCDN A3 -mdgk~mcE7 (i.e. the Kozak sequence and secretory sequence had 
been removed from the plasmid by digestion) to make pCDN A3 -mcIgk-linkerX-mcE7 (i.e., 
IgkS2-l to 12, IgkS2~25 and 26, IgkS2-30 to 33 and IgkS2-44 and 45 as illustrated in Table 
8, SEQ ID NOs: 30 to 49). 

[0191] For Asn the fragments were also ligated into Kpnl/BamHI-out pCDNA3- 
10 Igk-hE7Asnl/2 (i.e. IgkSl-11 and 12) to make pCDNA3-mcIgk-linkerNl/2-nE7Asnl/2 
IgkS2-llb and IgkS2-12b, see Table 12). 

E7 Protein Expression 

Cell culture 

[0192] CHO cells were cultured in DMEM (GIBCO from Invitrogen) containing 
15 10% foetal bovine serum (FBS) (DKSH), penicillin, streptomycin and glutamine (GIBCO 
from Invitrogen) at 37° C and 5% C0 2 . Cells were plated into 6-well plates at 3 x 10 5 /well, 
24 hours prior to transfection. For each transfection, 2 jag of DNA was mixed with 50 jaL 
OptiMEM (GIBCO from Invitrogen) and 4 jllL Plus reagent (Invitrogen) and incubated at 
room temperature (RT) for 30 min. Lipofectamine (Invitrogen; 5 p,L in 50 \xL OptiMEM) was 
20 added and the complexes incubated at RT for 30 min. The cells were rinsed with OptiMEM, 2 
mL OptiMEM were added to each well, and the complexes then added. The cells were 
incubated overnight at 37°C and 5% C0 2 . The following morning the complexes were 
removed and 2ml of fresh DMEM containing 2% FBS added to each well. 

[0193] Cell pellets and supernatants were collected about 40 h after transfection. 
25 The cell pellets were resuspended in lysis buffer (0.1% NP-40, 2 (ag/mL Aprotinin, 1 jxg/mL 
Leupeptin and 2mM PMSF in PBS). Transfections were carried out in duplicate and repeated. 
Control transfections, with empty vector (pCDNA3), were also carried out. 

Western blotting 

[0194] Western blots of the CHO cell supernatants or lysates were carried out 
30 according to standard protocols. Briefly, this involved firstly separating the samples by 
polyacrylamide gel electrophoresis (PAGE). For cell lysates, 30 jag of total protein were 
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loaded for each sample. For supernatants, 30 jjL of each was loaded. The protein samples 
were boiled with SDS-PAGE loading buffer for 10 mins before loading onto 12% SDS-PAGE 
gels and the gels were run at 150-200V for approximately lh. 

[0195] The separated proteins were then transferred from the gels to PVDF 
5 membrane (100V for lh). The membranes were blocked with 5% skim milk (in PBS/0.05% 
Tween 20 (PBS-T)) for lh at room temperature and were then incubated with the primary 
antibody, HPV-16 E7 Mouse Monoclonal Antibody (Zymed Laboratories) at a concentration 
of 1 : 1000 in 5% skim milk (in PBS-T) overnight at 4°C. Following washing of the membrane 
in PBS-T (3 X lOmin), secondary antibody, anti-mouse IgG (Sigma) in 5% skim milk, was 

10 added and the membrane incubated at room temperature for 4h. The membranes were washed 
as before, incubated in a mixture containing equal volumes of solution A (4.425 mL water, 50 
jliL luminol, 22 jliL p-coumaric and 500 joL 1M Tris pH 8.5) and solution B (4.5 mL water, 3 
|liL 30% H2O2 and 500 jaL 1M Tris pH8.5) for 1 min, and then dried and wrapped in plastic 
wrap. Film was exposed to the blots for various times (lmin, 3 min or 10 min) and the film 

15 then developed. 

Gene Gun Immunization Protocols 
Plasmid Purification 

[0196] All plasmids used for vaccination were grown in the Escherichia coli strain 
DH5a and purified using the Nucleobond Maxi Kit (Machery-Nagal). DNA concentration 
20 was quantitated spectrophotometrically at 260 nm. 

Preparation of DNA/gold cartridges 

[0197] Coating of gold particles with plasmid DNA was performed as described in 
the Biorad Helios Gene Gun System instruction manual using a microcarrier loading quantity 
(MLQ) of 0.5mg gold/cartridge and a DNA loading ratio of 2 fig DNA/mggold. This resulted 

25 in 1 |LXg of DNA per prepared cartridge. In brief 50 \iL of 0.05M spermidine (Sigma) was 

added to 25 mg of 1 .0 jum gold particles (Bio-Rad) and the spermidine/gold was sonicated for 
3 seconds.. 50 |Lig of plasmid DNA was then added, followed by the drop wise addition of 100 
|j,L 1M CaCl2 while vortexing. The mixture was allowed to precipitate at room temperature 
for 10 min, then centrifuged to pellet the DNA/ gold. The pellet was washed three times with 

30 HPLC grade ethanol (Scharlau), before resuspension in HPLC grade ethanol containing 0.5 

mg/mL of polyvinylpyrrolidone (PVP) (Bio-Rad). The gold/plasmid suspension was then 

coated onto Tefzel tubing and 0.5 inch cartridges prepared. 
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Gene Gun Immunization of Mice 
[0198] Groups of 8 female C57BL6/J (6-8 weeks old) (ARC, WA or Monash 
Animal Services, VIC) were immunized on Day 0, Day21, Day 42 and Day 63 with the 
relevant DNA. The day before each immunization the abdomen of each mouse was shaved 
5 and depilatory cream (Nair) applied for 1 minute. DNA was delivered with the Helios gene 
gun (Biorad) using a pressure of 400psi. Mice were given 2 shots on either side of the 
abdomen, with 1 |iig of DNA delivered per shot. Serum was collected via intra-ocular bleed 2 
days prior to initial immunization and 2 weeks after each subsequent immunization (Day 2, 
Day 35, Day 56 and Day 77). 

10 ELISA to measure E7 immune response 

[0199] Nine peptides spanning the full-length of HPV 1 6E7 (Frazer et al , 1 995) 
were used to measure the E7 antibody response. The peptides were synthesised and purified to 
>70% purity by Auspep (Melbourne). Peptides GF101 to 106 and GF108 to 109 described in 
Frazer et al were made. Note that instead of GF107, GF107a was used: 

1 5 HYNIVTFCCKCDSTLRL. 

[0200] GF1 02 D13G, GF1 03 D5G/C8G/E1 0G and GF104E2G peptides, named 
GF102n, GF103n and GF104n respectively, were also synthesised. These peptides were used 
for the ELISA when measuring antibodies to non-oncogenic E7 i.e. these peptides incorporate 
the mutations that were made to make the E7 protein non-oncogenic. 

20 [0201] Microtiter plates were coated overnight with 50 jllL of lO^g/mL E7 peptide 

per well. After coating, microtiter plates (Maxisorp, Nunc) were washed two times with 
PBS/0.05% Tween 20 (PBS-T) and then blocked for two hours at 37°C with 100 \\L of 5% 
skim milk powder in PBS-T. After blocking, plates were washed three times with PBS-T and 
50 \xL of mouse sera at a dilution of 1 in 100 was added for 2 hours at 37°C. All serum was 

25 assayed in duplicate wells. Plates were then washed three times with PBS-T and 50 juL of 
sheep anti-mouse IgG horseradish peroxidise conjugate (Sigma) was added at a 1 in 1000 
dilution. After 1 hour plates were washed and 50 \\L of OPD substrate was added. Absorbance 
was measured after 30 min and the addition of 25 pL of 2.5 M HC1 at 490 nm in a Multiskan 
EX plate reader (Pathtech). Note controls were included: control primary antibody for a 

30 positive control, secondary antibody only, and day 0 serum/ serum from unimmunized mice 
as negative controls. 
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[0202] The immune response preferences of codons determined from these 
experiments are tabulated in TABLE 1 . 

EXAMPLE 2 

Construction of Codon Modified Influenza A Virus (H5N1) HA DNA For 



[0203] The wild-type nucleotide sequence of the influenza A virus, HA gene for 
hemagglutinin (A/Hong Kong/21 3/03 (H5N1), MDCK isolate, embryonated chicken egg 
isolate) is shown in SEQ ID NO: 50 and encodes the amino acid sequence shown in SEQ ID 
NO: 51. Several codons within that sequence were mutated using the method described in 
10 Example 1. Specifically, the method involved replacing codons of the wild type nucleotide 
sequence with corresponding synonymous codons having higher immune response 
preferences than the codons they replaced, as represented in Table 1 . An illustrative codon 
modified nucleotide sequence comprising high immune response preference codons is shown 
in SEQ ID NO: 52. 



Construction of Codon Modified Influenza A Virus (H3N1) DNA For 
Conferring an Enhanced Immune Response to H3N1 HA 

[0204] The wild-type nucleotide sequence of the influenza A virus, HA gene for 
hemagglutinin (A/swine/Korea/PZ72-l/2006(H3Nl)) is shown in SEQ ID NO: 53 and 
20 encodes the amino acid sequence shown in SEQ ID NO: 54. Specifically, the method 
involved replacing codons of the wild type nucleotide sequence with corresponding 
synonymous codons having higher immune response preferences than the codons they 
replaced, as represented in Table 1. An illustrative codon modified nucleotide sequence 
comprising high immune response preference codons is shown in SEQ ID NO: 55. 

25 EXAMPLE 4 

Construction of Codon Modified Influenza A Virus (H5N1) NA DNA For 
Conferring an Enhanced Immune Response to H5N1 NA 

[0205] The wild-type nucleotide sequence of the influenza A virus, NA gene for 
neuraminidase (A/Hong Kong/213/03(H5N1), NA gene neuraminidase, MDCK isolate, 
30 embryonated chicken egg isolate) is shown in SEQ ID NO: 56 and encodes the amino acid 



5 



Conferring an Enhanced Immune Response to H5N1 HA 



15 



EXAMPLE 3 
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sequence shown in SEQ ID NO: 57. Several codons within that sequence were mutated using 
the method described in Example 1. Specifically, the method involved replacing codons of the 
wild type nucleotide sequence with corresponding synonymous codons having higher immune 
response preferences than the codons they replaced, as represented in Table 1. An illustrative 
5 codon modified nucleotide sequence comprising high immune response preference codons is 
shown in SEQ ID NO: 58. 

EXAMPLE 5 

Construction of Codon Modified Influenza A Virus (H3N1) NA DNA For 
Conferring an Enhanced Immune response to H3N1 NA 

10 [0206] The wild-type nucleotide sequence of the influenza A virus, NA gene for 

neuraminidase (A/swine/MI/PU243/04(H3Nl)) is shown in SEQ ID NO: 59 and encodes the 
amino acid sequence shown in SEQ ID NO: 60. Several codons within that sequence were 
mutated using the method described in Example 1 . Specifically, the method involved 
replacing codons of the wild type nucleotide sequence with corresponding synonymous 

15 codons having higher immune response preferences than the codons they replaced, as 

represented in Table 1. An illustrative codon modified nucleotide sequence comprising high 
immune response preference codons is shown in SEQ ID NO: 61. 

EXAMPLE 6 

Construction of Codon Modified Hepatitis C Virus El (1 AH77) DNA For 
20 Conferring an Enhanced Immune Response to HCV El (1 AH77) 

[0207] The wild-type nucleotide sequence of the hepatitis C Virus El, (serotype 
1 A, isolate H77 ? from polyprotein nucleotide sequence AF009606) is shown in SEQ ID NO: 
62 and encodes the amino acid sequence (NP 751920) shown in SEQ ID NO: 63. Several 
codons within that sequence were mutated using the method described in Example 1 . 
25 Specifically, the method involved replacing codons of the wild type nucleotide sequence with 
corresponding synonymous codons having higher immune response preferences than the 
codons they replaced, as represented in Table 1 . An illustrative codon modified nucleotide 
sequence comprising high immune response preference codons is shown in SEQ ID NO: 64. 
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EXAMPLE 7 

Construction of Codon Modified Hepatitis C Virus E2 (1 AH77) DNA For 
Conferring an Enhanced Immune Response to HCV E2 (1 AH77) 
[0208] The wild-type nucleotide sequence of the hepatitis C Virus E2, (serotype 
1 A, isolate H77, from polyprotein nucleotide sequence AF009606) is shown in SEQ ID NO: 
65 and encodes the amino acid sequence (NP 751921) shown in SEQ ID NO: 66. Several 
codons within that sequence were mutated using the method described in Example 1 . 
Specifically, the method involved replacing codons of the wild type nucleotide sequence with 
corresponding synonymous codons having higher immune response preferences than the 
codons they replaced, as represented in Table 1 . An illustrative codon modified nucleotide 
sequence comprising high immune response preference codons is shown in in SEQ ID NO: 
67. 

EXAMPLE 8 

Construction of Codon Modified Epstein - Barr Virus Type 1 gp350 DNA For 
Conferring an Enhanced Immune Response to EBV Type 1 gp350 
[0209] The wild-type nucleotide sequence of the Epstein - Barr virus, EBV type 1 
gp350 (Gene BLLF1, strand 77142-79865) is shown in SEQ ID NO: 68 and encodes amino 
acid sequence (CAD53417) shown in SEQ ID NO: 69. Several codons within that sequence 
were mutated using the method described in Example 1. Specifically, the method involved 
replacing codons of the wild type nucleotide sequence with corresponding synonymous 
codons having higher immune response preferences than the codons they replaced, as 
represented in Table 1 . An illustrative codon modified nucleotide sequence comprising high 
immune response preference codons is shown in SEQ ID NO: 70. 

EXAMPLE 9 

Construction of Codon Modified Epstein - Barr Virus Type 2 gp350 DNA For 
Conferring an Enhanced Immune Response to EBV Type 2 gp350 

[0210] The wild-type nucleotide sequence of the Epstein - Barr virus, EBV type 2 
gp350 (Gene BLLF1, strand 77267-29936) is shown in SEQ ID NO: 71 and encodes the 
amino acid sequence (YP 001 129462) shown in SEQ ID NO: 72. Several codons within that 
sequence were mutated using the method described in Example 1 . Specifically, the method 
involved replacing codons of the wild type nucleotide sequence with corresponding 
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synonymous codons having higher immune response preferences than the codons they 
replaced, as represented in Table 1. An illustrative codon modified nucleotide sequence 
comprising high immune response preference codons is shown in SEQ ID NO: 73. 

EXAMPLE 10 

5 Construction of Codon Modified Herpes Simplex Virus 2 Glycoprotein B DNA 
For Conferring an Enhanced Immune Response to HSV-2 Glycoprotein B 
[0211] The wild-type nucleotide sequence of the Herpes Simplex virus 2, 
glycoprotein B strain HG52 (genome strain NC 001798) is shown in SEQ ID NO: 74 and 
encodes the amino acid sequence (CAB06752) shown in SEQ ID NO: 75. Several codons 

10 within that sequence were mutated using the method described in Example 1 . Specifically, the 
method involved replacing codons of the wild type nucleotide sequence with corresponding 
synonymous codons having higher immune response preferences than the codons they 
replaced, as represented in Table 1 . An illustrative codon modified nucleotide sequence 
comprising high immune response preference codons is shown in SEQ ID NO: 76. 

15 EXAMPLE 11 

Construction of Codon Modified Herpes Simplex Virus 2 Glycoprotein D DNA 
For Conferring an Enhanced Immune Response to HSV-2 Glycoprotein D 
[0212] The wild-type nucleotide sequence of the Herpes Simplex virus 2, 
glycoprotein D strain HG52 (genome strain NC 001798) is shown in SEQ ID NO: 77 and 
20 encodes the amino acid sequence (NP 044536) shown in SEQ ID NO: 78. Several codons 

within that sequence were mutated using the method described in Example 1. Specifically, the 
method involved replacing codons of the wild type nucleotide sequence with corresponding 
synonymous codons having higher immune response preferences than the codons they 
replaced, as represented in Table 1. An illustrative codon modified nucleotide sequence 
25 comprising high immune response preference codons is shown in SEQ ID NO: 79. 

EXAMPLE 12 
Optimised E7 and HSV-2 Constructs 

Desien and synthesis of optimal and least optimal E7 constructs 

[0213] One de-optimized (W) and three optimized (01-03) E7 constructs were 
30 designed and made using the codon preferences summarized in Table 1 ("the Immune 
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Coricode table"). The least favourable codons were used for construct W. For the first 
optimized construct, Ol, whose sequence is shown in SEQ ID NO: 81, all of the codons were 
modified to those codons determined most optimal. 02, whose sequence is shown in SEQ ID 
NO: 82, is an alternative optimized construct which involved changing all Ala to GCT; Arg 
5 CGG and AGG to CGA and AGA, respectively; Glu to GAA; Gly to GGA; He to ATC; all 
Leu to CTG; Phe to TTT, Pro to CCT or CCC, Ser to TCG, Thr to ACG; and all Val except 
GTG to GTC. The 02 modifications avoided, with the exception of Leu and He, changing 
codons to mammalian consensus-preferred codons. For 03, whose sequence is shown in SEQ 
ID NO: 83, only certain amino acids for which particularly distinct differences were observed 
10 between codons, and for which the optimal codon(s) was not also a mammalian consensus 
preferred codon, were modified. In particular, in 03 all non-preferred Gly, Leu, Pro, Ser and 
Thr codons were changed to GGA, CTC, CCT, TCG and ACG, respectively, and where a 
preferred codon was already used it was not altered. Codons for other amino acids in 03 were 
not modified. 

15 Humoral and cellular responses to biolistic immunization with the optimal and least 

optimal E7 constructs 

[0214] As may be seen in Figure 18 (a) all three optimized constructs (Ol to 03) 
gave rise to significantly larger antibody responses than the wild-type construct as measured 
by both the peptide ELIS A and a GST-E7 protein ELISA. The amplitudes of the response 

20 were not statistically different between the three optimized constructs. The de-optimized 
construct, W, whose sequence is shown in SEQ ID NO: 84, gave a very low antibody 
response, appearing slightly lower but not statistically different from the wild-type (wt) codon 
usage (CU) construct, whose sequence is shown in SEQ ID NO: 80. From the IFN-y 
ELISPOT experiments, a representative example of which is shown in Figure 1 8, it appears 

25 that the codon preferences for maximizing the antibody response are similar to those required 
for maximising the T cell response: the de-optimized construct W failed to give a measurable 
response in the IFN-y ELISPOT assay and two of the optimized constructs (02 and 03) gave 
statistically significantly larger responses than the wild-type CU construct. Over the three 
repeats the responses to 02 and OS were not statistically different from each other. 

30 Unexpectedly, and in contrast to the antibody trend, in two of the three repeat experiments Ol 
gave a similar cellular response to the wt CU construct, which was less than that achieved by 
the 02 or 03 constructs. 
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Humoral and cellular responses to immunization bv intradermal Injection with the 

optimal and least optimal E7 constructs 
[0215] The humoral and cellular responses of mice to the optimized, wild-type CU 
and de-optimized constructs delivered by intradermal injection were also measured and the 
5 results are summarized in Figure 19. In general, similar trends were observed for intradermal 
injection as for biolistic delivery. 

[0216] From the E7 protein ELISA, it is apparent that the three optimized 
constructs, 01-03, were all significantly better at generating antibodies than the wild-type 
construct and that the de-optimized construct gave a very low antibody response similar to 
10 wild-type. The optimized constructs all gave rise to significantly more spots in the IFN-y 

ELISPOT than the wild-type construct and the de-optimized construct failed to give rise to a 
measurable response. 

[0217] The amplitudes of the antibody responses to gene gun immunization were 
larger than that for the intradermally (ID) delivered vaccines, despite the ID immunization 
1 5 delivering more than five times the dose. 

Design and synthesis of optimal and least optimal HSV~2construct$ 

[0218] Three optimized (Ol-OS; whose sequences are shown in SEQ ID NO: 86- 
88, respectively) and a de-optimized construct (W; whose sequence is shown in SEQ ID NO: 
88) encoding full-length glycoprotein D from Herpes Simplex Virus 2 (gD2) were prepared. 
20 A control construct pCDNA3-gD2 with wt CU was also made. Wild-type CU, whose 
sequence is shown in SEQ ID NO: 85, is close to MC CU. 

Humoral responses to biolistic and intradermal immunization with the optimal and least 

optimal sD2 constructs 
[0219] C57B1/6 mice were immunized in two groups (8 mice/construct; used 
25 intradermal injection (ID) and gene gun delivery) using the same immunization protocol as 
for the E7 constructs. 

[0220] Group 1 included pCDNA3-gD2 and pCDNA3-gD2 Ol. Group 2 included 
pCDNA3-gD2, pCDNA3-gD2 02, pCDNA3-gD2 03, and pCDNA3-gD2 W. 

[0221] Antibody responses were measured by an ELISA using plates coated with 
30 CHO cell supernatant containing C -terminally His tagged and truncated gD2. The truncation 
is at amino acid residue 331 and removes the transmembrane region resulting in the protein 
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being secreted into the medium. Control ELISA plates coated with supernatant from CHO 
cells transfected with empty vector were used as a control. 

[0222] For both biolistic and intradermal injection delivery routes it was found that 
the three optimized constructs generated similar levels of antibodies as the wt CU gD2 
5 construct (Figure 20). The de-optimized construct, W gD2, was very poor at generating 
antibodies, particularly when delivered by intradermal injection. The two delivery methods 
resulted in similar levels of antibodies. 

[0223] To date, there are no DNA vaccines on the market for the treatment or 
prevention of disease in humans. There is a need to maximize the immune responses 
10 generated by DNA vaccines and the present invention discloses ways of enhancing efficacy of 
DNA vaccines by using codons that have a higher preference for producing an immune 
response. 

[0224] The study described in this Example has validated the Immune Coricode 
table by applying it to optimization or de-optimization of the HPV16 E7 and HSV-2 
1 5 glycoprotein D (gD2) genes and demonstrating that this does enhance or reduce, respectively, 
the antibody or cellular response to biolistic delivery of these genes to mammals such as mice. 

Material and Methods 

ELISPOT Assay 

[0225] For the IFN-y ELISPOTs, mice were immunized twice, at days 0 and 21 , 
20 and the spleens were collected 3 weeks after the second immunization. 

Intradermal Injection Protocol 
[0226] The timing and frequency of the immunizations by intradermal injection 
were the same as for gene gun immunization. At each immunization 5 |ag of DNA was 
injected per ear le. a total of 10 jxg was administered per immunization per mouse. Hair 
25 removal prior to immunization was not necessary. The timing of bleeds and spleen collection 
was the same as for the gene gun immunized mice. 

GST-E7 ELISA 

[0227] The GST-E7 ELISA was carried out in the same way as the peptide ELISA 
with the exception that the plates were coated overnight with 50 \xL of 10 |ng/mL GST-tagged 
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E7 protein (kindly provided by the Frazer group from the Diamantina Institute, The 
University of Queensland, Brisbane). 

HSV-2 gD ELISA 

[0228] This ELISA was carried out in the same way as the E7 ELISAs with the 
5 exception that the plates were coated with supernatant from CHO cells transfected with a 
vector encoding C-terminally His-tagged and truncated gD2 protein. Control plates coated 
with supernatant from CHO cells transfected with empty vector were also used. 

Detection of HPV-specific responses 
[0229] For the detection of HPV-specific responses, 96-well filter ELISPOT plates 

10 (Millipore) were coated overnight with 10 ixg/mL HPV GST-tagged E7 protein in 0.1 M 
NaHC0 3 . For the detection of total IgG secreting cells, 96-well filter ELISPOT plates were 
coated overnight with 2 j^g/mL goat anti-mouse Ig (Sigma) in PBS without MgCl 2 and CaCl?. 
After coating, plates were washed once with complete DMEM without FCS and then blocked 
with complete DMEM supplemented with 10% FCS for one hour at 37° C. Cultured mouse 

15 spleen cells were washed and added to ELISPOT plates at 10 6 cells/100 \xL. For the detection 
of HPV-specific memory B cells, plates were incubated overnight at 37° C and for measuring 
total IgG cells, plates were incubated for 1 hour at 37° C. For detection, we used biotinylated 
goat anti-mouse IgG (Sigma) in PBS-T/1% FCS, followed by 5 jug/mL HRP-conjugated 
avidin (Pierce) and developed using 3-amino-9-ethylcarbozole (Sigma). Developed plates 

20 were counted using an automated ELISPOT plate counter. 

E7 IFN-y ELISPOT 

[0230] 96-well filter plates (Millipore) were coated overnight with 4 \xg/mL of 
monoclonal antibody (AN18; Mabtech). After coating, plates were washed once with 
complete RPMI and blocked for 2 hours with complete RPMI with 10% foetal calf serum 

25 (FCS; CSL Ltd). Mouse spleens were made into single cell suspensions and treated with ACK 
lysis buffer, washed and resuspended at a concentration of 10 7 cells/mL. Spleen cells 
(10 6 /well) were added to each well followed by the addition of complete RPMI supplemented 
with recombinant hIL-2 (ProSpec-Tany TechnoGene Ltd) and peptide to a final concentration 
of lOIU/well and 1 |ug/mL, respectively. Medium containing hIL-2 without peptide was added 

30 to control wells. Plates were incubated for approximately 18 hours at 37° C in 5-8% CO2. 
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[0231] After overnight incubation, cells were lysed by rinsing the plates in tap 
water and then washed six times in PBS/0.05% Tween 20 (PBS-T). For detection, 
biotinylated detection mAb (R4-6A2; Mabtech) in PBS-T/2% FCS was added, followed by 
horse radish peroxidase (HRP)-conjugated strepavidin and DAB (Sigma). Developed plates 
5 were counted using an automated ELISPOT plate counter. 

[0232] The disclosure of every patent, patent application, and publication cited 
herein is hereby incorporated herein by reference in its entirety. 

[0233] The citation of any reference herein should not be construed as an admission 
that such reference is available as "Prior Art" to the instant application. 

1 0 [0234] Throughout the specification the aim has been to describe the preferred 

embodiments of the invention without limiting the invention to any one embodiment or 
specific collection of features. Those of skill in the art will therefore appreciate that, in light 
of the instant disclosure, various modifications and changes can be made in the particular 
embodiments exemplified without departing from the scope of the present invention. All such 

1 5 modifications and changes are intended to be included within the scope of the appended 
claims. 
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WHAT IS CLAIMED IS: 

1. A method for constructing a synthetic polynucleotide from which a 
polypeptide is producible to confer an immune response to a target antigen in a mammal 
in a different quality than that conferred by a parent polynucleotide that encodes the 
5 same polypeptide, wherein the polypeptide corresponds to at least a portion of the target 
antigen, the method comprising: (a) selecting a first codon of the parent polynucleotide 
for replacement with a synonymous codon, wherein the synonymous codon is selected 
on the basis that it exhibits a different preference for conferring an immune response 
("an immune response preference") than the first codon in a comparison of immune 
10 response preferences; and (b) replacing the first codon with the synonymous codon to 
construct the synthetic polynucleotide, wherein the comparison of immune response 
preferences of the codons is represented by TABLE 1 : 



TABLE 1 



Amino Acid 


Ranking of Immune Response Preferences for Synonymous 

Codons 


Ala 


Ala GCT > Ala GCC > (Ala GCA , Ala 000 ) 


Arg 


(Arg CGA , Arg CGC , Arg CGT , Arg AGA ) > (Arg AGG , Arg CGG ) 


Asn 


Asn^ 0 > Asn^ 1 " 


Asp 


Asp GAC > Asp GAT 


Cys 


Cys TGC >Cys TGT 


Glu 


Glu GAA >Glu GAG 


Gin 


Gln CAA = Gln CAG 


Gly 


Gly GGA > (Gly GGG Gly GGT , Gly GGC ) 


His 


His CAC = His CAT 


He 


Ile ATC » Ile ATT >Ile ATA 


Leu 


(Leu CTG Leu CTC ) > (Leu CTA , Leu CTT ) » Leu 71 " 6 > Leu™ 


Lys 


Lys^ 0 = Lys^ 


Phe 


1 Phe rrT >Phe TTC 
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TV 

Pro 


t* CCC -«w t» CCT «^ /Tk CCA t» CCG\ 

Pro > Pro » (Pro , Pro ) 


Ser 


o TCG /C i TCT o TCA e TCCn ^ AGC o~-AG T\ 

Ser » (Ser 5 Ser , Ser ) » (Ser , Ser^ ) 


Thr 


nni ACG t»i ACC ^ t>\ AC A — t^i. ACT 

Thr > Thr » Thr > Thr 


Tvr 
lyr 


Tvr TAC > Tvr TAT 


Val 


(Val GTG Val GTC ) > Val GTT > Val GTA 







2. The method according to claim 1, wherein the synthetic polynucleotide 
confers a stronger or enhanced immune response than the parent polynucleotide under 
the same conditions and wherein the first and synonymous codons are selected from 
TABLE 2: 



TABLE 2 



First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala 000 


Ala 007 


IIe ATA 


Ile ATC 


Ser^ 57 


Ser 7CG 


Ala 000 


Ala 000 


Ile ATA 


Ile ATT 


Ser AGT 


Ser 7CT 


Ala^ 


Ala 007 


Ile A7T 


Ile ATC 


Ser^ 7 


Ser 7CA 


Ala GCA 


Ala 000 






Ser^ 7 


Ser 7CC 


Ala 000 


Ala 007 


Leu TTA 


Leu CTG 


Ser AGC 


Ser 7CG 






Leu 7 ™ 


Leu CTC 


Ser^ 


Ser TCT 


Arg CGG 


Arg CGA 


Leu 7 ™ 


Leu CTA 


Ser AGC 


Ser 7CA 


Arg CGG 


Arg CGC 


Leu™ 


Leu CTT 


Ser^ 0 


Ser 7CC 


Arg CGG 


Arg CGT 


Leu™ 


Leu™ 


Ser 7CC 


Ser TCG 


Arg CGG 


ArgAOA 


Leu TTG 


Leu CTG 


Ser 7CA 


Ser 7CG 


Arg AGG 


Arg CGA 


Leu 71 " 0 


Leu CTC 


Ser TCT 


Ser 7CG 


Arg AGG 


Arg CGC 


Leu TTG 


Leu CTA 






Arg AGG 


Arg CGT 


Leu TTG 


Leu CTT 


Thr Acr 


Thr ACG 
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ago 
Arg AGG 


AHA 

Arg 


Leu c " 


t CTG 


rr>1 ACT 

Thr ^ 


r-T~l-l ACC 

Thr 






t CTT 

Leu~" 


T CTC i 

Leu c 


ACT 

Thr 


<t>i ACA 

Thr 


A AT 


A AC 

Asn^ 0 


Leu CIA 


T CTG 

Leu ° 


t^i ACA 

Thr 


rr<i ACG 

Thr 






Leir 1A 


T CTC 

Leu^ lc 


ACA 

Thr 


ACC 

Thr 


Asp GAT 


GAC 

Asp OAC 






rpi ACC 

Thr 


Ti—ACG 

Thr 






^ TIC 

Phe" c 


t*i TTT 

Phe 






Cys TGT 


_ TOP 

Cys TGC 






rp TAT 

Tyr 


T^, TAC 

lyr 






t> CCG 

Pro cco 


CCC 

Pro 






n An 

Glu GAG 


^ _ HA A 

Glu GAA 1 


TT> CCG 


™ CCT 

Pro 


T r iGTA 

Val 


Val 






_ CCA 

Pro CLA 


tt, CCC 

Pro 


A r iGTA 

Val 


Val 


Gly 


Gly 0 ^ 


CCA 


T-k CCT 

Pro 


w^iGTA 

Val 


A/o iGTT 

Val 




rilv GGA 


Pro CCT 


Pro ccc 


Val GTT 


Val GTG 


Gly GGG 


Gly GGA 






Val GTT 


Val GTC 















3. The method according to claim 1, wherein the synthetic polynucleotide 
confers a stronger or enhanced immune response than the parent polynucleotide under 
the same conditions and wherein the first and synonymous codons are selected from 
5 TABLE 3: 

TABLE 3 



First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala GCG 


Ala 007 


Leu™ 


Leu CTA 


Ser AGT 


Ser TCG 


Ala GCA 


Ala GCT 


Leu™ 


Leu CTT 


Ser AGT 


Ser TCT 


Ala GCC 


Ala GCT 


Leu™ 


Leu™ 


Ser AGT 


Ser TCA 






Leu TTG 


Leu CTA 


Ser AGC 


Ser TCG 


Arg CGG 


Arg CGA 


Leu™ 


Leu CTr 


Ser AGC 


Ser TCT 
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can 
Arg CGG 


OCYT 

Arg 






Ser AOC 


n TCA 

Ser 1L - A 


can 

Arg 


Arg AGA 


_ , ttc 

Phe^ 0 


Phe ,TT 


Ser^ 


r>i TCC 


a nr. 

Arg 


rv; a 

Arg CGA | 






r-t TCC 

Ser lcc 


« TCG 


Ann 

Arg AGG 


Arg 


Pro cco 


t-v CCT 


01 TCA 

Ser IWV 


o TCG 

Ser'^ u 


Ann 

Arg 


Arg AGA 


^ CCA 


„ CCT 


n TCT 

Ser ^ 


o TCG 

Ser 














Glu GAG 


„ n a a 

Glu GAA 






Thr ACT 


rw-»l ACG i 

Thr 










1 -T-M ACT 

Thr 


i-T-rf ACA 

Thr 


Gly QGC 


nn a 

Gly 






rx^-| AC A 

Thr 


nri ACG 

Thr 


Gly 007 


Gly 1 






t->i ACC 

Thr 


nr"i_ ACG 

Thr 




vjiy 


















Val GTA 


Val GTT 















4. The method according to claim 3, further comprising selecting a second 
codon of the parent polynucleotide for replacement with a synonymous codon, wherein 
the synonymous codon is selected on the basis that it exhibits a higher immune response 
5 preference than the second codon in a comparison of immune response preferences; and 
(b) replacing the second codon with the synonymous codon, wherein the comparison of 
immune response preferences of the codons is represented by TABLE 4: 



TABLE 4 



Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymou 
s Codon 


Ala 000 


Ala GCT 


Ile ATA 


Ile ATC 


Ser^ 1 " 


Ser TCG 


Ala 000 


Ala GCC 


Ile ATA 


Ile ATT 


Ser^ 1 


Ser TCT 


Ala GCA 


Ala GCT 


Ile ATT 


Ile ATC 


Ser^ 7 


Ser TCA 


Ala 00 ^ 


Ala 000 






Ser AGT 


Ser TCC 
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Ala GCC 


Ala GCT 


Leu™ 


Leu CTG 


Ser^ 0 


Ser TCG 






Leu™ 


Leu CTC 


Set** 3 


Ser TCT 


Arg CGG 


Arg CGA 


Leu TTA 


Leu CTA 


Ser^ 0 


Ser TCA 


Arg CGG 


Arg CGC 


Leu TTA 


Leu CTT 


Se^ 00 


Ser TCC 


a CGG 

Arg 


Arg CGT 


Leu TTA 


Leu TTO 


Ser TCC 


Ser TCG 


Arg CGG 


Arg AGA 


Leu" 0 


Leu CTG 


Ser TCA 


Ser TCG 


Arg AGG 


Arg CGA 


Leu" 6 


Leu CTC 


Ser TCT 


Ser TCG 


Arg AGG 


Arg CGC 


Leu" 0 


Leu CTA 






Arg AGG 


Arg CGT 


Leu™ 


Leu CTT 


Thr ACT 


Thr ACG 


ArgAGG 


Arg AGA 


Leu CTT 


Leu CTO 


Thr ACT 


Thr Acc 






Leu CTT 


Leu CTC 


Thr ACT 


Thr ACA 


Asn^ 7 


Asn^ 0 


Leu CTA 


Leu CTG 


Thr ACA 


Thr ACG 






Leu CTA 


Leu CTC 


Thr ACA 


Thr ACC 


Asp GAT 


Asp GAC 






Thr ACC 


Thr ACG 






Phe" 0 


Phe" 1 " 






Cys TGT 


Cys TGC 






Tyr TAT 


Tyr TAC \ 






Pro CCG 


Pro ccc 






Glu GAG 


Glu GAA 


Pro CCG 


Pro CCT 


Val GTA 


Val GTG 






Pro CCA 


Pro ccc 


Val GTA 


Val GTC 


Gly GGC 


Gly GGA 


Pro CCA 


Pro CCT 


Val GTA 


Val Grr 


Gly GGT 


Gly GGA 


Pro CCT 


Pro ccc 


Val GTT 


Val GTG 


Gly GGG 


Gly GGA 






Val GTT 


Val GTC 















5. The method according to claim 1, wherein the synthetic polynucleotide 
confers a weaker or reduced immune response than the parent polynucleotide under the 
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same conditions and wherein the first and synonymous codons are selected from the 
TABLE 5: 

TABLE 5 



First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala 007 


Ala 000 


Ile ATC 


Ile ATA 


Ser 7CG 


Ser 7CT 


Ala 001 


Ala GCA 


Ile ATC 


Ile A7T 


Ser 7CG 


Ser 7CA 


Ala 007 


Ala GCC 


Ile ATT 


Ile ATA 


Ser 7CG 


Ser 7CC 


Ala 000 


Ala 000 






Ser 7CG 


Ser^ 0 


Ala 000 


Ala 00 * 


Leu CTG 


Leu CTA 


Ser 7CG 


Ser AGT 






Leu CTG 


Leu CTT 


Ser 7CT 


Ser* 00 


Arg CGA 


Arg AGG 


Leu CTG 


Leu 770 


Ser 7CT 


Ser* 37 


Arg CGA 


Arg CGG 


Leu CTG 


Leu 77A 


Ser 7CA 


Ser^ 0 




Arg AGG ! 


Leu CTC 


Leu C7A 


Ser 7CA 


Ser AGT 




Arg CGG 


Leu CTC 


Leu C7T 


Ser 700 


Ser AGC 




Arg AGG 


Leu CTC 


Leu 770 


Ser 7cc 


Ser AGT 




Arg CGG 


Leu CTC 


Leu 77A 






Are AGA 


Arg AGG 


Leu CTA 


Leu 77G 


Thr Aco 


Thr ACC 


Are AGA 


Arg CGG 


Leu CTA 


Leu™ 


Thr ACG 


Thr ACA 






Leu CTT 


Leu 770 


Thr ACG 


Thr ACT 


Asn^ 0 


Asn^ 7 


Leu CTr 


Leu™ 


Thr ACC 


Thr ACA 






Leu 770 


Leu™ 


Thr Acc 


Thr ACT 


Asp GAC 


Asp GAT 






Thr ACA 


Xhr ACT 






Phe 777 


Phe 770 






Cys TGC 


Cys TGT 






Tyr 7AC 


Tyr 7AT 






Pro ccc 


Pro CCT 
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Glu GAA 


« n An 

Glu GAG 


Pro ccc 


p ro CCA 


Val GTG 


Val GTT 






Pro ccc 


Pro CCG 


Val GTG 


. PIT A 

Val GTA 


Gly GGA 


. nr.r 

Gly GGC 


Pro CCT 


Pro CCA 


Val GTC 


Val GTT 


pi GGA 




r>„-CCT 

rro 


rro 


Val 


\/ 0 iGTA 

val 


Gly GGA 


Gly GGG 






Val GTT 


VaI GTA 















6. The method according to claim 1, wherein the synthetic polynucleotide 
confers a weaker or reduced immune response than the parent polynucleotide under the 
same conditions and wherein the first and synonymous codons are selected from 
TABLE 6: 

TABLE 6 



First Codon 

A Hot V^UUVJH 


Svnonvmous 
Codon 


First Codon 


Synonymous 
Codon 


First Codon 


Synonymous 
Codon 


Ala 007 


Ala 000 


Leu CTA 


Leu TTG 


Ser TCG 


Ser TCT 


Ala GCT 


Ala GCA 


Leu CTA 


Leu TTA 


Ser TCG 


Ser TCA 


Ala GCT 


Ala GCC 


Leu CTr 


Leu™ 


Ser TCG 


Ser TCC 






Leu CTr 


Leu™ 


Ser TCG 


Ser AGC 


Arg CGA 


Arg AGG 


Leu 170 


Leu™ 


Ser TCG 


Ser AGT 


Arg CGA 


Arg CGG 






Ser TCT 


Ser* 00 


Arg CGT 


Arg AGG 


Phe™ 


Phe™ 


Ser TCT 


Ser AGT 


Arg CGT 


Arg CGG 






Ser TCA 


Ser AGC 


Arg AGA 


Arg AGG 


Pro CCT 


p ro CCA 


Ser TCA 


Ser* 01 


Arg AGA 


Arg CGG 


Pro CCT 


Pro CCG 


Ser TCC 


Ser AGC 














Glu GAA 


Glu GAG 






Thr Aco 


Thr ACC 










Thr ACO 


Thr ACA 
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First Codon 


Synonymous 
Coaon 


First Codon 


Synonymous 
Coaon 


First Codon 


Synonymous 
Coaon 


GGA 

Gly°° 


i GGC 

Gly uu 






ACG 

Thr 


rr-.1 APT 

Thr AL1 


GGA 


GGT 

Gly u 






rx--t ACA 

Thr 


T-i ACT 




r^u 7 GGG 

vjiy 


















Val GTT 


Val GTA 















7. The method according to claim 6, further comprising selecting a second 
codon of the parent polynucleotide for replacement with a synonymous codon, wherein 
the synonymous codon is selected on the basis that it exhibits a lower immune response 
5 preference than the second codon in a comparison of immune response preferences; 
and; (b) replacing the second codon with the synonymous codon, wherein the 
comparison of immune response preferences of the codons is represented by TABLE 7: 



TABLE 7 



Second 
Codon 


Synonymous 
Codon ! 


Second 
Codon 


Synonymous 
Codon 


Second 
Codon 


Synonymous 
Codon 


Ala GCT 


Ala GCG 


Ile ATC 


Ile ATA 


Ser TCG 


Ser TCT 


Ala GCT 


Ala 0 " 


Ile ATC 


Ile ATT 


Ser TCG 


Ser TCA 


Ala GCT 


Ala GCC 


Ile ATT 


Ile ATA 


Ser TCG 


Ser TCC 


Ala 000 


Ala 000 






Ser TCG 


Ser AGC 


Ala 000 


Ala 00 * 


Leu CTG 


Leu CTA 


Ser TCG 


Ser^ 7 






Leu CTG 


Leu C7T 


Ser TCT 


Ser AGC 


Arg CGA 


Arg AGO 


Leu CTG 


Leu 770 


Ser TCT 


Ser^ 7 


Arg CGA 


Arg CGG 


Leu CTG 


Leu™ 


Ser TCA 


Ser^ 0 


Arg CGC 


Arg AGG 


Leu CTC 


Leu CTA 


I Ser TCA 


Ser^ 7 


Arg CGC 


Arg CGG 


Leu CTC 


Leu CTT 


Ser TCC 


Ser AGC 
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Arg CGT 


Arg AGG 


Leu CTC 


Leu TTG 


' Ser TCC 


Ska** 


Arg CGT 


Arg CGG 


Leu CTC 


Leu TTA ■ 






Arg AGA 


ArgAGG 


Leu CTA 


Leu TTG | 


Thr ACG 


Thr ACC 


Arg AGA 


Arg CGG 


Leu CTA 


Leu TTA 


Thr ACG 


Thr ACA 






Leu CTT 


Leu TTG j 


Thr ACG 


Thr ACT 


Asn^ 0 


Asn^ 


Leu CTT 


Leu TTA 


Thr ACC 


Thr ACA 






Leu TTG 


Leu TTA 


Thr ACC 


Thr ACT 


Asp GAC 


Asp GAT 






Thr ACA 


Thr ACT 






Phe TTT 


Phe TTC 






Cys TGC 


Cys TGT 






Tyr TAC 


Tyr TAT 






Pro ccc 


Pro CCT 






Glu GAA 


Glu GAG 


Pro ccc 


p ro CCA 


Val GTG 


Val GTT 






Pro ccc 


Pro CCG 


Val GTG 


Val GTA 


Gly GGA 


Gly GGC 


Pro CCT 


p ro CCA 


Val GTC 


Val GTT 


Gly GGA 


Gly GGT 


Pro CCT 


Pro CCG 


Val GTC 


Val GTA 


Gly GGA 


Gly GGG 






Val GTT 


Val GTA 















8. A synthetic polynucleotide constructed according to any one of claims 1 to 

7. 

9. A chimeric construct that comprises a synthetic polynucleotide constructed 
5 according to claim 8, wherein the synthetic polynucleotide is operably connected to a 

regulatory polynucleotide. 

10. A pharmaceutical composition that is useful for modulating an immune 
response to a target antigen in a mammal, which response is conferred by the expression 
of a parent polynucleotide that encodes a polypeptide corresponding to at least a portion 

10 of the target antigen, the composition comprising a chimeric construct and a 
pharmaceutically acceptable excipient and/or carrier, wherein the chimeric construct 
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comprises a synthetic polynucleotide that is operably connected to a regulatory 
polynucleotide and that is distinguished from the parent polynucleotide by the 
replacement of a first codon in the parent polynucleotide with a synonymous codon that 
has a different immune response preference than the first codon and wherein the first 
5 and synonymous codons are selected according to any one of TABLES 2, 3, 5 and 6. 

11. The composition according to claim 10, further comprising an adjuvant that 
enhances the effectiveness of the immune response. 

12. The composition according to claim 10, which is formulated for 
transcutaneous administration. 

10 13. The composition according to claim 10, which is formulated for epidermal 

administration. 

14. The composition according to claim 10, which is formulated for dermal 
administration. 

15. The composition according to claim 10, which is formulated for intradermal 
15 administration. 

16. The composition according to claim 10, which is formulated for biolistic 
delivery. 

17. The composition according to claim 10, which is formulated for 
microneedle delivery. 

20 18. The composition according to claim 10, which is formulated for intradermal 

injection. 

19. The composition according to claim 10, wherein the synthetic 
polynucleotide confers a stronger or enhanced immune response than the parent 
polynucleotide under the same conditions and wherein the first and synonymous codons 

25 are selected according to TABLES 2 or 3. 

20. The composition according to claim 10, wherein the synthetic 
polynucleotide confers a weaker or reduced immune response than the parent 
polynucleotide under the same conditions and wherein the first and synonymous codons 
are selected according to TABLES 5 or 6. 

30 21. A method of modulating the quality of an immune response to a target 

antigen in a mammal, which response is conferred by the expression of a parent 
polynucleotide that encodes a polypeptide corresponding to at least a portion of the 
target antigen, the method comprising: introducing into the mammal a synthetic 
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polynucleotide that is operably connected to a regulatory polynucleotide and that is 
distinguished from the parent polynucleotide by the replacement of a first codon in the 
parent polynucleotide with a synonymous codon that has a different immune response 
preference than the first codon and wherein the first and synonymous codons are 
5 selected according to any one of TABLES 2, 3, 5 and 6. 

22. The method according to claim 21, wherein expression of the synthetic 
polynucleotide results in a different quality of immune response than the one obtained 
through expression of the parent polynucleotide under the same conditions. 

23. The method according to claim 21, wherein the chimeric construct is 
10 introduced into the mammal by delivering the construct to antigen-presenting cells of 

the mammal. 

24. The method according to claim 21 , wherein the chimeric construct is 
introduced into the dermis and/or epidermis of the mammal. 

25. The method according to claim 21 , wherein the chimeric construct is 
1 5 introduced into the mammal by transcutaneous administration. 

26. The method according to claim 21, wherein the chimeric construct is 
introduced into the mammal by epidermal administration. 

27. The method according to claim 21, wherein the chimeric construct is 
introduced into the mammal by intradermal administration. 

20 28. The method according to claim 21, wherein the chimeric construct is 

introduced into the dermis or epidermis of the abdomen. 

29. The method according to claim 21, wherein the immune response is selected 
from a cell-mediated response and a humoral immune response. 

30. The method according to claim 21, wherein the immune response is a 
25 humoral immune response. 

31. A method of enhancing the quality of an immune response to a target 
antigen in a mammal, which response is conferred by the expression of a parent 
polynucleotide that encodes a polypeptide corresponding to at least a portion of the 
target antigen, the method comprising: introducing into the mammal a chimeric 

30 construct comprising a synthetic polynucleotide that is operably connected to a 
regulatory polynucleotide and that is distinguished from the parent polynucleotide by 
the replacement of a first codon in the parent polynucleotide with a synonymous codon 
that has a higher immune response preference than the first codon, wherein the first and 
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synonymous codons are selected according to TABLES 2 or 3, whereby expression of 
the introduced synthetic polynucleotide confers a stronger or enhanced immune 
response than the one conferred by expression of the parent polynucleotide under the 
same conditions. 

5 32. A method of reducing the quality of an immune response to a target antigen 

in a mammal, which response is conferred by the expression of a parent polynucleotide 
that encodes a polypeptide corresponding to at least a portion of the target antigen, the 
method comprising: introducing into the mammal a chimeric construct comprising a 
synthetic polynucleotide that is operably connected to a regulatory polynucleotide and 

10 that is distinguished from the parent polynucleotide by the replacement of a first codon 
in the parent polynucleotide with a synonymous codon that has a lower immune 
response preference than the first codon, wherein the first and synonymous codons are 
selected according to TABLES 5 or 6, whereby expression of the introduced synthetic 
polynucleotide confers a weaker or reduced immune response than the one conferred by 

1 5 expression of the parent polynucleotide under the same conditions. 

33. A method of enhancing the quality of an immune response to a target 
antigen in a mammal, which response is conferred by the expression of a first 
polynucleotide that encodes a polypeptide corresponding to at least a portion of the 
target antigen, the method comprising: co-introducing into the mammal a first nucleic 

20 acid construct comprising the first polynucleotide in operable connection with a 
regulatory polynucleotide; and a second nucleic acid construct comprising a second 
polynucleotide that is operably connected to a regulatory polynucleotide and that 
encodes an iso-tRNA corresponding to a codon of the first polynucleotide, wherein the 
codon has a low or intermediate immune response preference and is selected from the 

25 group consisting of Ala GCA , Ala GCG , Ala GCC , Arg AGG Arg CGG Asn^, Asp GAT , Cys TGT , 
Glu GAG , Gly GGG , Gly GGT , Gly GGC , Ile ATA , Ile ATT , Leu TTG , Leu TTA , Leu CTA , Leu CTT , 
Phe TTC , Pro CCA , Pro CCG Pro CCT , Se^ 00 , Ser^, Ser TCT , Ser TCA , Ser TCC , Thr^, Thr ACT , 
Tyr TAT ,Val GTA andVal GTT . 

34. The method according to claim 33, wherein the codon has a 'low' immune 
30 response preference, and is selected from the group consisting of Ala GCA , Ala GCG , 

Arg AGG , Arg CGG Asn^, Asp GAT , Cys TGT , Glu GAG , Gly GGG Gly GGT , Gly GGC , Ile ATA , 
Leu TTG , Leu TTA , Phe TTC , Pro CCA , Pro CCG Sei^ 30 , Ser^ 1 , Thr ACT , Tyr TAT and Val GTA . 
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IgkCl GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

I gkS 1 - 1 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

I gkS 1-2 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl-3 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl-4 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

I gkCl CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1-1 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1 - 2 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1-3 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1-4 CCAGGTTCCACTGGTGACGGATC C ATGCATGG AGATACACCTAC ATTG CATGAATATATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

IgkCl TTAGATTTG CAAC CAGAG ACAACTGGTCTCTACGGTTATGGG C AATTAAATG ACAG CTC A 

I gkS 1-1 TTAGATTTG CAACCAG AG AC AACTGGTCTCTACGGTTATGGG C AATTAAATG ACAG CTCA 

I gkS 1 - 2 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTT ATGGGCAATTAAATG ACAG CTCA 

IgkSl - 3 TTAGATTTG C AACC AGAGACAACTGGTCTCTACGGTTATGGG CAATTAAATGACAG CTCA 

I gkS 1-4 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

IgkC2 CTGG ACCTGCAGCC CGAGACCACCGGCCTGTACGG CTACGGCCAG CTC AACGACAGCAGC 

181 191 201 211 221 231 

I gkC 1 GAGGAGG AGG ATG AAAT AG ATGGTC C AGCTGG ACAAGCAG AAC CGGACAG AG C C CATT AC 

I gkS 1 - 1 GAGGAGGAGG ATG AAATAG ATGGTC CAGCGGGACAAGCGG AAC CGGACAGAG CG CATT AC 

IgkSl-2 GAGGAGGAGG ATGAAATAG ATGGTC CAGCAGG ACAAGCAG AAC CGGACAGAG C ACATTAC 

I gkS 1 - 3 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCTGAACCGGACAGAGCTCATTAC 

I gkS 1-4 GAGGAGGAGGATGAAATAGATGGTCCAGCCGGACAAGCCGAACCGGACAGAGCCCATTAC 

IgkC2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

I gkC 1 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

I gkS 1 - 1 AATATTGTAAC CTTTTGTTGC AAGTGTGACTCTACGCTTCGGTTGTGCGTAC AAAG C ACA 

I gkS 1 - 2 AATATTGTAAC CTTTTGTTG C AAGTGTGACTCTACGCTTCGGTTGTGCGTAC AAAG C AC A 

I gkS 1 - 3 AATATTGTAAC CTTTTGTTGC AAGTGTGACTCTACGCTTCGGTTGTGCGTAC AAAG C AC A 

I gkS 1 - 4 AATATTGTAAC CTTTTGTTGCAAGTGTG ACTCTACGCTTCGGTTGTGCGTAC AAAG C AC A 

IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

I gkC 1 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1-1 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1-2 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1 - 3 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1 - 4 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

IgkC2 CACGTGG AC ATC CGCAC C CTGG AGGAC CTGCTG ATGGG C AC C CTGGGCATCGTGTG C C CC 

361 371 381 

IgkCl ATCTGCTCTCAG AAG C C CTAAGAATTC 

I gkS 1-1 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1 - 2 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1 - 3 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1 - 4 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkC2 ATCTGCTCC CAGAAGC C CTAAGAATTC 
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IgJcS 1 - 5 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

I gkS 1 - 6 GGTACCGC CGCCAC CATGG AGACAG AC AC ACTC CTG CTATGGGTACTG CTG CTCTGGGTT 

I gkS 1-7 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl -8 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl - 9 GGTACCGCCGCCAC CATGG AGACAG ACAC ACTC CTGCTATGGGTACTG CTG CTCTGGGTT 

I gkS 1-10 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

I gkC 1 GGTAC CGCCGC CACCATGGAGACAGACAC ACTC CTGCTATGGGTACTG CTGCTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

IgkS 1-5 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

IgkS 1-6 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1-7 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

IgkSl-8 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

IgkS 1-9 C C AGGTTC C ACTGGTGACGG ATCCATGC ATGGAG ATAC ACCTAC ATTGCATG AAT ATATG 

I gkS 1 - 1 0 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

IgkCl C CAGGTTCCACTGGTGACGGATCCATGC ATGGAG ATACACCTACATTG C ATG AATATATG 

IgkC2 CCCGG CTCC AC CGGCGACGG ATC CATGC ACGG CGAC ACC C C CACCCTGCACGAGT AC ATG 

121 131 141 151 161 171 

I gkS 1-5 TTAG ATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

IgkSl - 6 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

I gkS 1 - 7 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

IgkSl - 8 TTAGATTTGCAACCAGAGACAACTGGTCT CTACGGTTATGGGCAATT AAATGAC AG CTC A 

I gkS 1-9 TTAGATTTGC AACC AGAGACAACTGGT CTCTACGGTTATGGG C AATTAAATG ACAG CTCA 

IgkSl- 10 TT AG ATTTGC AAC CAGAGACAACTGGTCTCTACGGTTATGGGC AATTAAATG AC AG CTCA 

I gkC 1 TTAGATTTGCAACC AGAGAC AACTGGTCTCT ACGGTTATGGG C AATTAAATGAC AG CTCA 

I gkC2 CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

IgkSl - 5 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGGGCCCATTAC 

IgkSl- 6 GAGGAGGAGG ATGAAATAG ATGGTC C AG CTGGAC AAGCAG AAC CGGACAGAG C CC ATTAC 

IgkSl - 7 G AGG AGG AGGATG AAATAG ATGGTCCAGCTGG ACAAGC AGAACCGG AC CGGGC C C ATTAC 

IgkS 1 - 8 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACCGAGCCCATTAC 

IgkSl - 9 GAGGAGGAGG ATGAAATAG ATGGTC GAG CTGGAC AAGCAG AAC CGG AC CGTGC CC ATTAC 

IgkSl-10 G AGG AGG AGGATG AAATAG ATGGTCCAG CTGGACAAG C AGAAC CGG AC CG CGCC C ATTAC 

IgkCl GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkC 2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

IgkSl - 5 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTAGGTTGTGCGTACAAAGCACA 

I gkS 1-6 AATATTGTAACCITTTGTTGCAAGTGTGACTCTACGCTTAGATTGTGCGTACAAAGCACA 

IgkSl - 7 AAT ATTGTAAC CTTTTGTTG C AAGTGTG ACTCTACGCTTCGGTTGTGCGTAC AAAG C AC A 

IgkSl - 8 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGATTGTGCGTACAAAGCACA 

IgkSl - 9 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGTTTGTGCGTACAAAGCACA 

IgkSl - 10 AAT ATTGTAAC CTTTTGTTG C AAGTGTGACTCTACGCTTCG CTTGTGCGTAC AAAGCACA 

IgkCl AATATTGT AACCTTTTGTTGCAAGTGTG ACTCTACG CTTCGGTTGTG CGTAC AAAG C AC A 

IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

IgkSl - 5 CACGTAGACATTAGGACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTG TGCCCC 

IgkSl- 6 C ACGTAG AC ATT AG AACTTTGG AAG AC CTGTTAATGGGC AC ACTAGG AATTGTGTGC C CC 

I gkS 1 - 7 C ACGTAG ACATTCGGACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTG C C C C 

IgkS 1-8 CACGTAGACATTCGAACTTTGGAAGAC CTGTTAATGGG CACACTAGGAATTGTGTGCCCC 

I gkS 1 - 9 C ACGTAG ACATTCGTACTTTGGAAGACCTGTTAATGGGCAC ACTAGG AATTGTGTG C C C C 

IgkSl - 10 C ACGTAG ACATTCGC ACTTTGGAAGACCTGTTAATGGG C AC ACTAGG AATTGTGTG CC C C 

IgkCl CACGTAGAC ATTCGT ACTTTGG AAGAC CTGTTAATGGG CACACTAGGAATTGTGTGCCCC 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkS 1 - 5 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkSl - 6 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1-7 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkSl - 8 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1-9 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkSl - 1 0 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC T^T/^TTT>T7 O 

IgkC2 ATCTG CTCCC AGAAG C C CT AAG AATTC JT'lOUxCll 2* 
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IgkCl GGTACCGCCGC CACCATGGAGAC AG ACACACTC CTGCTATGGGTACTGCTG CTCTGGGTT 

IgkSl- 12 GGTAC CG CCGCCACC ATGGAGAC AGAC AC ACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl - 3 1 GGTACCGC CGCC AC CATGG AG ACAG ACACACTC CTGCTATGGGTACTG CTGCTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

IgkCl CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
I gkS 1 - 1 2 CC AGGTTC CACTGGTGACGG ATCCATG CATGG AGATACACCTACATTGCATGAATATATG 

I gkS 1 - 3 1 C CAGGTTCCACTGGTG ACGGATCCATGCATGGAGATACAC CT AC ATTG CATG AATATATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

I gkC 1 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
I gkS 1 - 1 2 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAACGACAGCTCA 
I gkS 1 - 3 1 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkC2 CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

IgkCl GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkSl - 1 2 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAG CAGAACCGGACAGAGC C CATTAC 

I gkS 1 - 3 1 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkC2 GAGG AGGAGGACGAGATCG ACGGCC CCGC CGG C C AGG C CG AGCCCGAC CGCGC C C ACTAC 

241 251 261 271 281 291 

I gkC 1 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
I gkS 1-12 AACATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
I gkS 1 - 3 1 AATATTGTAACCTTTTGTTGCAAATGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkC2 AAC ATCGTG AC CTTCTGCTGC AAGTG CG AC AGC AC CCTGCGC CTCTG CGTG C AG AG CAC C 

301 311 321 331 341 351 

IgkCl CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IgkSl - 12 CACGTAGACATTCGTACTTTGGAAG AC CTGTT AATGGG C ACACTAGGAATTGTGTG C C CC 

I gkS 1-31 CACGTAGACATTCGTACTTTGGAAG AC CTGTTAATGGGC AC ACTAGG AATTGTGTG CCC C 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTC AGAAG C C CTAAG AATT C 

I gkS 1 - 1 2 ATCTGCTCTCAGAAGCCCTAAGAATTC 
I gkS 1-31 ATCTGCTCTC AGAAAC C CTAAG AATTC 

IgkC2 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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I gkC 1 GGT AC CGCCGC CACCATGGAGACAG ACAC ACTC CTG CT ATGGGTACTG CTG CTCTGGGTT 

IgkSl- 13 GGTACCGCCGCCACCATGGAGACAGATACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkSl - 14 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

IgkCl C C AGGTTCCACTGGTG ACGG ATC C ATGC ATGGAG AT AC ACCT AC ATTGCATGAATAT ATG 

IgkS 1 - 1 3 CCAGGTTCCACTGGTGATGGATCCATGCATGGAGATACAC CTAC ATTG CATG AATATATG 

I gkS 1-14 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGACACACCTACATTGCATGAATATATG 
IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

! g kc 1 TTAGATTTG C AACCAGAG ACAACTGGTCTCT ACGGTTATGGG CAATTAAATG AC AG CTCA 

IgkSl - 13 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGATAGCTCA 
IgkSl - 14 TTAGACTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
I gkC2 CTGG ACCTGCAG C CCG AGAC CAC CGGCCTGTACGGCT ACGGC CAGCTCAACGACAGCAG C 

181 191 201 211 221 231 

IgkCl G AGG AGGAGG ATG AAATAG ATGGTCCAG CTGG AC AAGC AGAAC CGG ACAGAG C C CATT AC 

IgkSl - 1 3 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGATAGAGCCCATTAC 
I gkS 1-14 G AGGAGG AGG ACGAAATAGACGGTC GAG CTGGAC AAGC AGAAC CGGAC AG AGCCCATTAC 

IgkC2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

IgkCl AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
I gkS 1 - 1 3 AAT ATTGTAACCTTTTGTTGC AAGTGTG ATT CTACG CTTCGGTTGTGCGTAC AAAG C ACA 

IgkSl- 14 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkC2 AACATCGTG ACCTTCTGCTGCAAGTGCGACAG CAC C CTGCGCCTCTG CGTG C AGAGC AC C 

301 311 321 331 341 351 

I gkCl CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IgkSl - 13 CACGTAGATATTCGTACTTTGG AAGATCTGTTAATGGGC AC ACT AGG AATTGTGTG CC CC 

IgkSl- 14 C ACGTAG AC ATTCGTACTTTGGAAGAC CTGTTAATGGG CACACTAGG AATTGTGTGC C C C 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkS 1-13 ATCTGCTCTCAGAAGCCCTAAGAATTC 
I gkS 1 - 1 4 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkC2 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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IgkCl GGTACCGCCG CCACCATGGAG ACAGACACACTCCTGCTATGGGTACTG CTGCTCTGGGTT 

IgkSl - 15 GGTACCGCCGCC AC CATGGAGAC AG ACACACTCCTGCTATGGGTACTG CTGCTCTGGGTT 

IgkS 1-16 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 HI 

IgkCl CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
IgkSl-15 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
IgkSl - 16 C C AGGTTC C ACTGGTG ACGGATC C ATGCATGGAGATACACCTAC ATTG C ATG AATAT ATG 

IgkC 2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkSl - 1 6 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkC2 CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

IgkCl G AGGAGGAGGATG AAATAG ATGGTCCAGCTGG AC AAGC AG AAC CGG AC AGAGC C C ATTAC 

IgkSl - 15 G AGGAGGAGGATG AAATAGATGGTC CAGCTGGACAAG CAG AAC CGG AC AG AG C CC ATT AC 

IgkSl-16 GAGGAGGAGG ATG AAATAG ATGGTCCAG CTGG AC AAGC AG AACCGG AC AGAGCC CATTAC 

Igkc2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

IgkCl AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkSl- 15 AATATTGTAACCTTTTGTTGTAAGTGTGACTCTACGCTTCGGTTGTGTGTACAAAGCACA 
IgkSl - 1 6 AATATTGTAAC CTTTTG CTG CAAGTGCG ACTCTACG CTTCGGTTGTGCGTAC AAAGC ACA 

IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

ICTkcl CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IqkSl-15 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGTCCC 
IakSl - 1 6 C ACGTAGACATTCGTACTTTGG AAGAC CTGTTAATGGGC AC ACTAGGAATTGTGTGCC C C 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTC AG AAG CC CTAAG AATTC 

I gkS 1 - 1 5 ATCTGTTCTC AG AAGC CCTAAGAATTC 

IgkSl - 1 6 ATCTG CTCTC AG AAG CC CTAAG AATTC 

IgkC2 ATCTG CTC C CAG AAGC CCTAAGAATTC 
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IgkS 1-17 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
I gkS 1 - 1 8 GGTACCGCCGCCAC CATGG AAAC AGACAC ACT C CTG CTATGGGTACTG CTG CTCTGGGTT 

IgkC2 GGTACCG C CG CCACCATGGAG ACCG AC AC C CTCCTGCTGTGGGTG CTGCTGCTCTGGGTG 

I gkC 1 GGTAC CGC CGCC AC CATGG AG AC AGACACACTCCTG CTATGGGTACTG CTG CTCTGGGTT 

61 71 81 91 101 111 

IgkSl-17 C C AGGTTC CACTGGTGACGGATCC ATGC ATGGAG AT ACACCTAC ATTG CATG AGTATATG 

IgkSl - 18 CC AGGTTCCACTGGTGACGG ATC CATGCATGGAGATACAC CTACATTG C ATGAATATATG 

IgkC 2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 
IgkCl C C AGGTTC CACTGGTGACGGATCCATGC ATGGAG AT AC ACCTACATTGCATGAATATATG 

121 131 141 151 161 171 

IgkS 1-17 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkSl-18 TTAGATTTGCAACCAGAAACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkC2 CTGGACCTG CAG CCCG AG ACCACCGGCCTGTACGG CTACGGCC AG CTCAACGACAG CAGC 

IgkCl TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

181 191 201 211 221 231 

IgkSl-17 GAGGAGGAGGATGAGATAGATGGTCCAGCTGGACAAGCAGAGCCGGACAGAGCCCATTAC 
G AAG AAGAAGATG AAATAG ATGGTC CAGCTGG AC AAG C AG AAC CGG AC AG AGCCC ATT AC 
GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 
GAGGAGG AGG ATG AAATAGATGGT C CAG CTGG AC AAGC AGAAC CGG AC AGAGC C C ATT AC 



IgkSl-18 
IgkC2 
IgkCl 



241 251 261 271 281 291 

IgkSl-17 AAT ATTGTAACCTTTTGTTGC AAGTGTGACTCTACG CTTCGGTTGTG CGT AC AAAGC ACA 

I qkS 1-18 AATATTGTAAC CTTTTGTTGC AAGTGTGACTCTACG CTTCGGTTGTGCGTACAAAGCAC A 

IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 
IgkCl AATATTGTAACCTTTTGTTG C AAGTGTG ACT CTACGCTTCGGTTGTGCGTACAAAGCAC A 

301 311 321 331 341 351 

I akS 1 - 1 7 CACGTAGACATTCGTACTTTGGAGG AC CTGTTAATGGGCAC ACTAGG AATTGTGTGC CCC 

lakSl - 1 8 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IakC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 
IgkCl CACGT AG AC ATTCGTACTTTGG AAGACCTGTT AATGGG C ACACTAGG AATTGTGTGCC C C 

361 371 381 

I gkS 1 - 1 7 ATCTG CTCTC AG AAGC C CTAAGAATTC 

I gkS 1-18 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkC2 ATCTGCTC C CAG AAGC C CTAAGAATTC 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 
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IgkCl GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkSl-19 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkSl -20 GGTACCGCCGCC AC C ATGG AGACAGAC AC ACTCCTGCTATGGGTACTG CTG CTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 . 81 91 101 111 

I gkC 1 C C AGGTTC C ACTGGTG ACGGATC CATGC ATGG AGATAC AC CTAC ATTGCATG AATATATG 

IgkSl- 19 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
I gkS 1 - 2 0 C C AGGTTC C ACTGGTG ACGGATC CATGCATGGAGATACAC CTAC ATTGCATG AATATATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

IgkCl TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkS 1 - 1 9 TTAGATTTGCAGC CAGAGACAACTGGTCTCTACGGTTATGGG C AGTTAAATG AC AG CTC A 

I gkSl - 2 0 TT AG ATTTG C AACCAG AGACAACTGGTCTCT ACGGTTATGGGCAATTAAATGAC AG CTC A 

IgkC2 CTGGACCTGCAGC C CGAG ACCAC CGG C CTGTACGGCTACGG C C AG CTC AACG AC AGC AG C 

181 191 201 211 221 231 

I gkC 1 G AGG AGG AGG ATGAAATAGATGGTCCAGCTGGAC AAG C AG AACCGG ACAG AGC C CATTAC 

IgkSl- 19 GAGG AGGAGGATG AAATAG ATGGTC CAGCTGG ACAGGC AGAAC CGGACAGAG C CCATTAC 

IgkSl - 2 0 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkC2 GAGGAGGAGGACG AGATCGACGG C C CCG CCGG C CAGGCCGAG C C CGAC CGCGCC CACT AC 

241 251 261 271 281 291 

I gkc 1 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkSl - 1 9 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAGAGCACA 
IgkSl - 20 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
I gkC 2 AACATCGTG AC CTTCTG CTG C AAGTGCG AC AG C ACCCTGCG CCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

IgkCl CACGTAGACATTCGTACITTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
I gkS 1 - 1 9 C ACGTAG ACIATTCGT ACTTTGGAAGACCTGTT AATGGG C ACACTAGGAATTGTGTGCC CC 

IgkSl-20 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkS 1 - 1 9 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkSl - 2 0 ATCTGCTCTC AAAAGC C CTAAGAATTC 

IgkC2 ATCTGCTC C C AG AAG C CCT AAG AATTC 
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IgkCl GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkS 1-21 GGT AC CG CCGC C AC C ATGGAGAC AGAC ACACTC CTGCTATGGGTACTG CTGCTCTGGGTT 

I gkS 1-22 GGTAC CGCCG CCACCATGG AG AC AG AC AC ACTCCTG CT ATGGGTACTGCTGCT CTGGGTT 

IgkSl - 23 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

I gkS 1 - 2 4 GGTAC CGCCGCCACCATGGAGAC AG ACACACTCCTGCTATGGGTACTGCTG CTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

I gkC 1 C C AGGTTCCACTGGTG ACGGATCCATGC ATGG AGAT AC ACCT AC ATTG C ATGAATAT ATG 

I gkS 1-21 CCAGGGTCCACTGGGGACGGATCCATGCATGGGGATACACCTACATTGCATGAATATATG 

I gkS 1-22 CCAGGATCCACTGGAGACGGA rCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1-23 C C AGGTTC C ACTGGTG AC GGATCCATGC ATGGTG ATAC AC CTACATTGC ATG AAT ATATG 

I gkS 1 - 2 4 CCAGGCTCCACTGGCG AC GGA TCCATGC ATGGCGATAC AC CT AC ATTG CATGAATAT ATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

I gkC 1 TTAG ATTTGCAAC CAG AGACAACTGGTCTCTACGGTTATGGGCAATTAAATGAC AG CTC A 

IgkS 1-21 TTAG ATTTG C AACC AGAGAC AACTGGG CTCTACGGGT ATGGGCAATTAAATG AC AGCTCA 

I gkS 1-22 TTAGATTTGCAACCAGAGACAACTGGACTCTACGGATATGGACAATTAAATGACAGCTCA 

IgkSl-23 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGTCAATTAAATGACAGCTCA 

IgkSl - 2 4 TTAGATTTG C AACC AG AG AC AACTGGCCTCTACGG CTATGG CCAATTAAATG ACAGCTCA 

IgkC2 CTGG AC CTGCAG C C CG AG ACCACCGG CCTGTACGGCTACGGCC AG CTC AACGAC AG CAG C 

181 191 201 211 221 231 

IgkCl G AGG AGG AGGATG AAAT AGATGGTC C AGCTGG AC AAG C AG AACCGG ACAG AGC C C ATTAC 

IgkSl - 2 1 GAGG AGG AGG ATGAAAT AG ATGGG CCAG CTGGGC AAGC AG AAC CGGAC AGAG CCCATTAC 

IgkSl - 22 GAGG AGG AGG ATGAAATAG ATGG ACC AGCTGG ACAAG CAGAACCGG ACAG AGC C C ATT AC 

IgkSl-23 GAGG AGG AGGATG AAAT AGATGGTC CAGCTGGTC AAG C AGAAC CGG ACAGAG C CC ATT AC 

IgkSl- 24 GAGG AGG AGG ATG AAATAGATGGC C CAG CTGGC CAAGC AG AAC CGGAC AG AG CCCATTAC 

IgkC2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

IgkCl AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

IgkS 1 - 2 1 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

IgkSl- 22 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

IgkSl - 2 3 AATATTGTAAC (3TTTTGTTGC AAGTGTG ACTCTACG CTTCGGTTGTG CGTAC AAAGCACA 

IgkSl -24 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

IgkC2 AAC ATCGTG ACCTTCTGCTG C AAGTGCG AC AGCAC C CTG CG C CTCTG CGTGCAGAGC AC C 

301 311 321 331 341 351 

IgkC 1 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

IgkSl - 2 1 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGGACACTAGGGATTGTGTGCCCC 

IgkSl-22 C ACGT AG AC ATTCGT ACTTTGGAAG AC CTGTTAATGGG AAC ACT AGGAATTGTGTGC C C C 

IgkSl - 2 3 CACGTAG ACATTCGTACTTTGG AAGACCTG TTAATGGGTACACTAGGTATTGTGTG CC C C 

I gkS 1 - 2 4 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGCATTGTGTGCCCC 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTC AG AAGCC CTAAGAATT C 

IgkSl - 2 1 ATCTGCTCTCAGAAG CC CT AAG AATTC 

I gkS 1 - 2 2 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1 - 2 3 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkSl - 2 4 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkC2 ATCTG CTCCCAG AAG CCCTAAG AATTC 
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I gkC 1 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkS 1-25 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
I gkS 1 - 2 6 GGTACCGC CG CCAC C ATGGAGACAG ACACACTCCTG CTATGGGTACTG CTG CTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

IgkCl CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
I gkS 1-25 CC AGGTTCC ACTGGTGACGG ATC C ATG C ATGG AGATACACCTAC ATTG C ATG AATATATG 

IgkS 1-26 CCAGGTTCCACTGGTGACGGATCCATGCACGGAGATACACCTACATTGCACGAATATATG 
IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

IgkCl TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkSl-25 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
I gkS 1 - 2 6 TTAG ATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGAC AGCTCA 

IgkC2 CTGGACCTG C AGC CCGAGACCAC CGG C CTGTACGGCTACGGCC AGCTC AACG AC AG C AG C 

181 191 201 211 221 231 

IgkCl G AGGAGGAGG ATGAAATAGATGGTCCAGCTGGACAAG C AGAAC CGGAC AGAGC C CATT AC 

IgkSl-25 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkSl-26 GAGGAGGAGG ATG AAATAGATGGT C GAG CTGGACAAG CAGAAC CGG AC AG AG C CC ACT AC 

IgkC2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

x gkC 1 AAT ATTGTAAC CTTTTGTTG C AAGTGTGACTCTACGCTTCGGTTGTG CGTAC AAAGCACA 

IgkSl-25 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkSl-26 AATATTG TAACCrTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTG CGTACAAAGCAC A 

Ig kC2 AAC ATCGTG ACCTTCTG CTGC AAGTGCG ACAGCAC C CTGCG C CTCTGCGTGCAG AG C ACC 

301 311 321 331 341 351 

IgkCl C ACGTAG ACATTCGTACTTTGG AAGACCTGTTAATGGG C ACACT AGGAATTGTGTGC C CC 

IgkSl-25 CATGTAG AC ATTCGT ACTTTGG AAG AC CTGTTAATGGGC AC ACTAGG AATTGTGTG C C C C 

IgkSl-26 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTCAG AAGCCCTAAG AATTC 

I gkS 1-25 ATCTG CTCTC AGAAG CC CTAAGAATTC 

I gkS 1 - 2 6 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkC2 ATCTGCTCCCAGAAG C C CTAAGAATTC 
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IgkCl GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
I gkS 1 - 2 7 GGT AC CG C CGCC ACCATGGAGACAG ACAC ACTC CTG CTATGGGTACTGCTGCTCTGGGTT 
IgkSl - 2 8 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkSl - 2 9 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkC2 GGTAC CGCCG C C AC C ATGGAG AC CGAC ACCCTC CTGCTGTGGGTG CTG CTG CTCTGGGTG 

61 71 81 91 101 111 

I gkC 1 C CAGGTTCC ACTGGTG ACGGATC CATGCATGG AG ATAC ACCTAC ATTG C ATGAATATATG 

I gkS 1 - 2 7 CC AGGTTC C ACTGGTGACGGATCC ATGC ATGG AGATACAC CT ACATTGCATGAATAT ATG 

I gkS 1-28 C CAGGTTCCACTGGTG ACGGATC CATGCATGGAG ATAC ACCTAC ATTGC ATGAATATATG 

IgkSl-29 CC AGGTTC C ACTGGTGACGGATCC ATGC ATGGAGATACAC CT ACATTGCATGAATAT ATG 

I gkC 2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

I gkC 1 TTAGATTTG C AAC C AGAGAC AACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTC A 

I gkS 1-27 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkSl - 28 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
I gkS 1 - 2 9 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkC2 CTGGAC CTGC AGC C CGAGAC C AC CGG CCTGTACGG CTACGGCC AGCTCAACG ACAGC AG C 

181 191 201 211 221 231 

IgkCl GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkSl - 2 7 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkS 1-28 GAGGAGGAGGATGAAATTGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkSl - 2 9 GAGGAGGAGGATGAAATCGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
IgkC2 GAGGAGGAGGACGAG AT CG ACGGCCC CGCCGG CC AGG C CG AGCCCG AC CGCG C CC ACTAC 

241 251 261 271 281 291 

IgkCl AATATTGT AAC CTTTTGTTGC AAGTGTGACTCTACG CTTCGGTTGTG CGT AC AAAGC AC A 

IgkSl - 2 7 AATATAGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
I gkS 1-28 AATATTGT AACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTG CGTAC AAAG C AC A 

IgkS 1 - 2 9 AATATCGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

IgkCl C ACGTAGACATTCGT ACTTTGG AAGAC CTGTTAATGGGCACACTAGG AATTGTGTGCCC C 

IgkS 1 - 2 7 CACGTAGACATACGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATAGTGTGCCCC 
IgkSl - 2 8 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 
IgkSl-29 CACGTAGACATCCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATCGTGTGCCCC 
IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTCAGAAG CC CT AAG AATTC 

IgkSl- 27 ATATGCTCTCAGAAGCCCTAAGAATTC 
I gkS 1 - 2 8 ATTTGCTCTCAG AAG C C CTAAGAATTC 

I gkS 1-29 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkC2 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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I gkS 1 - 5 0 GGTACCGCCGCCACCATGGAAACTGACACTCTGCTGCTGTGGGTACTGCTGCTGTGGGTT 

IgkSl-51 GGTAC CG C CGCCAC CATGG AAACTGAC ACTCTACT ACTATGGGTACTACTACTATGGGTT 

I gkS 1 - 5 2 GGTAC CGCCGCC^C CATGG AAACTG AC ACTCTTCTTCTTTGGGTACTTCTTCTTTGGGTT 

IgkSl - 5 3 GGTACCGCCGCCACCATGGAAACTGACACTCTCCTCCTCTGGGTACTCCTCCTCTGGGTT 

I gkS 1-54 GGTACCGCCGCCACCATGGAAACTGACACTTTGTTGTTGTGGGTATTGTTGTTGTGGGTT 

I gkS 1 - 5 5 GGTACCGCCGCCACCATGGAAACTGACACTTTATTATTATGGGTATTATTATTATGGGTT 

IgkC3 GGTAC CGCCGC C ACCATGG AGACAG ACACACTCCTGCTATGGGTACTGCTG CTCTGGGTT 

IgkC4 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

I gkS 1-50 C CAGG ATCGACTGGAG ACGG ATCCATGCATGGAG AC ACTC C AACTCTG C ATGAATAT ATG 

IgkS 1-51 CCAGGATCGACTGGAGACGGATCCATGCATGGAGACACTCCAACTCTACATGAATATATG 

IgkS 1-52 C CAGGATCG ACTGG AG ACGGATC CATGC ATGG AG AC ACTC C AACTCTTCATGAATAT ATG 

IgkS 1-53 C CAGGATCG ACTGG AGACGGATC C ATG CATGG AG AC ACTC CAACTCTCCATGAATATATG 

I gkS 1-54 CCAGGATCGACTGGAGACGGATCCATGCATGGAGACACTCCAACTTTGCATGAATATATG 

I gkS 1 - 5 5 CCAGGATCG ACTGG AG ACGGATCC ATG C ATGGAGACACTCCAACTTTACATG AATATATG 

I gkC 3 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATG AATATATG 

IgkC4 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

I gkS 1 - 5 0 CTGGACCTGCAACCGGAAACTACTGACCTGTACTGCTATGAACAACTGAATGACAGCTCG 

I gkS 1-51 CTAGACCTACAACCGGAAACTACTGACCTATACTGCTATGAACAACTAAATGACAGCTCG 

I gkS 1 - 5 2 CTTGAC CTTC AAC CGGAAACTACTG AC CTTTACTGCT ATGAAC AACTTAATGACAG CTCG 

I gkS 1 - 5 3 CTCGAC CTC CAAC CGGAAACTACTG AC CTCTACTGCTATGAAC AACTCAATG ACAG CTCG 

I gkS 1-54 TTGGACTTGCAACCGGAAACTACTGACTTGTACTGCTATGAACAATTGAATGACAGCTCG 

IgkSl- 55 TTAGACTTACAACCGGAAACTACTGACTTATACTGCTATGAACAATTAAATGACAGCTCG 

I g kC3 TTAG ATTTG C AACCAG AG ACAACTGATCTCTACTGTTATG AG CAATTAAATGACAGCTC A 

IgkC4 CTGGACCTG C AGC C CG AG ACC AC CGAC CTGTACTGCTACGAGCAG CTCAACGACAG C AG C 

181 191 201 211 221 231 

IgkSl- 50 G AAG AAGAAG ACGAAATAG ACGG ACCTGCAGGAC AAG C AG AAC GAG AC CGCGCACATT AC 

I gkS 1 - 5 1 G AAG AAG AAG ACG AAATAGACGG ACCTG C AGG AC AAGCAG AAC CAGAC CGCGCACATTAC 

IgkSl - 52 GAAGAAGAAGACGAAATAGACGGACCTGCAGGACAAGCAGAACCAGACCGCGCACATTAC 

I gkS 1 - 5 3 GAAG AAG AAGACG AAATAG ACGGACCTGCAGGACAAG C AG AAC CAG ACCG CG C AC ATT AC 

IgkSl- 54 G AAG AAGAAGACGAAATAG ACGGAC CTGCAGGAC AAG CAG AAC C AG ACCGCG C AC ATT AC 

IgkSl-55 GAAG AAGAAGACGAAATAGACGGAC CTGCAGGAC AAG C AG AACCAGAC CGCGCACATTAC 

IgkC3 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkC4 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

I gkS 1 - 5 0 AATATTGTAACTTTTTGCTGCAAGTGCGACAGTACTCTGCGCCTGTGCGTACAAAGCACT 

IgkS 1-51 AATATTGTAACTTTTTGCTGCAAGTGCGACAGTACTCTACGCCTATGCGTACAAAGCACT 

IgkS 1-52 AAT ATTGTAACTTTTTG CTG C AAGTGCGAC AGTACTCTTCGC CTTTGCGTAC AAAG C ACT 

I gkS 1 - 5 3 AATATTGTAACTTTTTGCTGCAAGTGCGACAGT ACTCTC CG C CTCTG CGTAC AAAGCACT 

I gkS 1 - 5 4 AATATTGTAACTTTTTGCTGCAAGTG CGAC AGTACTTTGCG CTTGTG CGT ACAAAGCACT 

IgkSl - 5 5 AATATTGTAACTTTTTGCTGCAAGTGCG AC AGTACTTT ACGCTTATG CGTACAAAG C ACT ' 

IgkC3 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

IgkC4 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

IgkSl - 5 0 CATGTAGACATTCGCACTCTGGAAGACCTGCTGATGGGAACTCTGGGAATTGTTTGCCCG 

I gkS 1-51 CATGTAGACATTCGCACTCTAGAAGACCTACTAATGGGAACTCTAGGAATTGTTTGCCCG 

IgkSl- 52 CATGTAGACATTCGCACTCTTGAAGACCTTCTTATGGGAACTCTTGGAATTGTTTGCCCG 

IgkSl- 53 CATGTAGACATTCGCACTCTCGAAGACCTCCTCATGGGAACTCTCGGAATTGTTTGCCCG 

IgkSl - 54 CATGTAG ACATTCG C ACTTTGG AAGACTTGTTG ATGGGAACTTTGGG AATTGTTTG C C CG 

I gkS 1-55 CATGTAGAC ATTCGCACTTTAGAAG ACTTATTAATGGG AACTTTAGGAATTGTTTG C C CG 

I gkC3 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

IgkC4 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkS 1-51 ATCTGCTCGCAAAAGCCTTAAGAATTC 

I gkS 1-53 ATCTGCTCGCAAAAGCCTTAAGAATTC 

I gkS 1 - 5 0 ATCTGCTCGCAAAAGCCTTAAGAATTC 

I gkS 1 - 5 2 ATCTGCTCGCAAAAGCCTTAAGAATTC 

IgkSl- 55 ATCTGCTCGCAAAAGCCTTAAGAATTC 

I gkS 1-54 ATCTGCTCGCAAAAGCCTTAAGAATTC 

I gkC 3 ATCTG CTCTCAGAAG CC CTAAG AATTC 

IgkC4 ATCTG CTCCC AG AAGCCCTAAGAATTC 
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IgkSl - 3 2 GGTACCG C CGCCACCATGGAG AC AGACAC ACTCCTG CTATGGGTACTGCTG CTCTGGGTT 

I gkS 1-33 GGTACCG C CG CC ACCATGGAG AC AG AC AC ACTCCTG CTATGGGTACTGCTG CTCTGGGTT 

I gkC 1 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

I gkS 1 - 3 2 CC AGGTTC CACTGGTGACGG ATC CATGC ATGGAG ATACACCTAC ATTGC ATG AATATATG 

IgkSl-33 C C AGGTTCC ACTGGTG ACGG ATCCATG C ATGGAG ATACAC CTAC ATTG C ATGAATAT ATG 

IgkCl C CAGGTTCCACTGGTGACGGATCCATGCATGG AGATACACCTACATTG CATGAAT ATATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

IgkSl - 3 2 TTAGATTTTCAACCAGAGACAACTGGTTTTTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkSl-33 TTAGATTTC C AACCAG AG ACAACTGGTTTCTACGGTTATGGG CAATT AAATGAC AGCTCA 

IgkCl TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
IgkC2 CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

I gkS 1 - 3 2 G AGG AGG AGGATGAAATAGATGGTC CAGCTGG ACAAGCAGAACCGGAC AG AGC CCATTAC 

I gkS 1-33 GAGG AGG AGG ATG AAATAG ATGGTCCAG CTGG ACAAG CAGAAC CGG ACAG AGCCC ATTAC 

GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 
GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 



IgkCl 
IgkC2 



241 251 261 271 281 291 

IgkSl - 3 2 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkSl - 3 3 AATATTGTAACCTTCTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkCl AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

IgkSl - 3 2 C ACGTAG AC ATTCGT ACTTTGG AAGACCTGTTAATGGGCACACT AGGAATTGTGTG C CC C 

IgkSl - 3 3 C ACGTAGAC ATTCGTACTTTGG AAG AC CTGTTAATGGG CACACTAGG AATTGTGTGCCC C 

I gkC 1 " CACGTAGACATTCGT ACTTTGG AAGACCTGTTAATGGGCACACTAGGAATTGTGTG C C C C 

I g kC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

I gkS 1-32 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkS 1-33 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkC2 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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IgkSl - 5 6 GGTAC CGC CGC C ACCATGG AAACTG ACACTCTCCTG CTATGGGTACTGCTGCTCTGGGTT 

IgkSl - 5 7 GGTACCGCCGCCACCATGGAAACTGACACTCTCCTG CTATGGGTACTGCTGCTCTGGGTT 

IgkS 1-58 GGTACCGCCGCCACCATGGAAACTGACACTCTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl - 5 9 GGTACCGCCGCCACCATGGAAACTGACACTCTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkC3 GGTAC CGC CG C C AC CATGGAGAC AGACACACTCCTG CTATGGGTACTGCTGCTCTGGGTT 

IgkC4 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

IgkSl - 5 6 CCGGGATCGACTGGAGACGGATCCATGCATGGAGACACTCCGACTTTGCATGAATATATG 

IgkSl- 57 CC AGGATCGACTGGAG ACGG ATC C ATG C ATGG AG ACACTC CAA CTTT GCATG AATATATG 

IgkSl - 5 8 C CTGG ATCG ACTGG AGACGGATCC ATGCATGG AGAC ACTCCTACTTTG C ATG AATATATG 

IgkSl - 5 9 CCCGGATCGACTGGAGACGGATCCATGCATGGAGACACTCCCACTTTGCATGAATATATG 

IgkC3 C C AGGTTCCACTGGTGACGGATC C ATG C ATGG AGAT AC ACCTAC ATTG C ATG AATATATG 

IgkC4 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

CT CGACTTG C AAC CGG AAACT ACTGACCTCTACTG CTATG AAC AATTGAATGACAGCTCG 
CTCGACTTGCAACCAGAAACTACTGACCTCTACTGCTATGAACAATTGAATGACAGCTCG 
IgkSl-58 CTCGACTTGCAACCTGAAACTACTGACCTCTACTGCTATGAACAATTGAATGACAGCTCG 
IgkS 1-59 CTCGACTTG C AAC C CGAAACTACTGACCTCTACTG CTATG AACAATTGAATGACAGCTCG 

IgkC3 " TTAGATTTGCAACCAGAGACAACTGATCTCTACTGTTATGAGCAATTAAATGACAGCTCA 
IgkC4 CTGGACCTG C AGC C CG AGAC C ACCGACCTGT ACTG CT ACG AG CAG CTCAACGAC AGCAGC 

181 191 201 211 221 231 

IgkSl-56 GAAGAAGAAGACGAAATAGACGGACCGGCAGGACAAGCAGAACCGGACCGCGCACATTAC 



IgkSl- 56 
IgkSl-57 



IgkSl- 57 
IgkSl-58 



GAAGAAG AAG ACG AAATAG ACGG AC C AGC AGG AC AAG CAG AAC CAG AC CGCGC AC ATT AC 
G AAG AAGAAGACG AAATAGACGGACCTG C AGG AC AAG C AGAACCTG ACCG CG C AC ATTAC 
IakSl - 5 9 GAAGAAG AAG ACG AAATAG ACGG AC CCGCAGG AC AAGC AG AAC CCG AC CG CGC AC ATT AC 

IqkC3 G AGG AGGAGGATG AAATAGATGGTCC AGCTGG AC AAGC AGAAC CGG AC AG AG C C C ATTAC 

IgkC4 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

IqkSl - 5 6 AAT ATTGT AACTTTTTGCTGCAAGTG CGAC AGTACT CTC CGCTTGTG CGTAC AAAGC ACT 

IakSl-57 AATATTGTAACTTTTTGCTGCAAGTGCGACAGTACTCTCCGCTTGTGCGTACAAAGCACT 

IqkSl- 58 AATATTGTAACTTTTTGCTGCAAGTGCGACAGTACTCTCCGCTTGTGCGTACAAAGCACT 

IakSl - 5 9 AATATTGTAACTTTTTGCTGCAAGTGCGACAGTACTCTCCGCTTGTGCGTACAAAGCACT 

IqkC3 AAT ATTGT AACCTTTTGTTG C AAGTGTGACTCTACG CTTCGGTTGTG CGTAC AAAGC AC A 

IgkC4 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

lakSl-56 C ATGT AG ACATTCGC ACTTTGG AAGAC CTCCT C ATGGG AACTTTGGG AATTGTTTG C CCG 

IakSl - 5 7 CATGTAG ACATTCGC ACTTTGG AAG ACCTC CTC ATGGGAACTTTGGGAATTGTTTGCCC A 

IqkS 1 - 5 8 C ATGTAGACATTCGCACTTTGGAAGAC CTCCT CATGGG AACTTTGGG AATTGTTTG C C CT 

IakSl - 5 9 CATGTAG ACATTCGC ACTTTGG AAG ACCTCCTCATGGG AACTTTGGG AATTGTTTGCCCC 

IqkC3 C ACGT AG ACATTCGTACTTTGGAAGAC CTGTT AATGGG C AC ACTAGG AATTGTGTG C CC C 

IgkC4 C ACGTGG AC ATC CG C ACC CTGG AGG ACCTGCTG ATGGGC AC C CTGGG C ATCGTGTGCC CC 

361 371 381 

IgkS 1-56 ATCTG CTCGCAAAAG C CGTAAG AATTC 

I gkS 1 - 5 7 ATCTG CTCGCAAAAGCCATAAGAATTC 

IgkSl- 58 ATCTG CTCGCAAAAGC CTTAAGAATTC 

IgkS 1-59 ATCTGCTCGCAAAAGCCCT AAG AATTC 

IgkC3 ATCTG CTCTCAG AAGC CCTAAGAATTC 

IgkC4 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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I gkS 1 - 3 4 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

I gkS 1-35 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkS 1-36 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkS 1-37 GGT ACCG C CGCC AC CATGG AG AC AGAC AC ACTC CTG CT ATGGGT ACTGCTG CTCTGGGTT 

I gkS 1-38 GGTAC CG C CGC C ACCATGGAG AC AGACACACT C CTG CT ATGGGT ACTGCTGCTCTGGGTT 

I gkS 1-39 GGTACCG C CGC C AC CATGGAG AC AGAC AC ACTCCTG CTATGGGTACTGCTG CTCTGGGTT 

I gkC 1 GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCT ATGGGT ACTGCTGCTCTGGGTT 

IgkC2 GGTAC CGC CGC CACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

I gkS 1 - 3 4 CC AGGTAGTACTGGTGACGGAAGTATGCATGG AGATAC AC CTAC ATTG C ATGAATATATG 

I gkS 1 - 3 5 CCAGGTAGCACTGGTGACGGAAGCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1 - 3 6 CCAGGTTCGACTGGTGACGGATCGATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1 - 3 7 CCAGGTTCAACTGGTGACGG ATCAATG C ATGGAGATACACCTACATTGCATG AATATATG 

IgkS 1-38 CCAGGTTCTACTGGTGACGGATCTATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1-39 C C AGGTTC C ACTGGTG ACGGATC C ATGCATGG AG ATAC ACCT AC ATTGCATGAATATATG 

I gkC 1 C C AGGTTC CACTGGTGACGGATCCATGCATGG AGATAC AC CTAC ATTGCATGAATATATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

IgkS 1-34 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGTAGT 

I gkS 1-35 TTAG ATTTG CAAC CAG AGAC AACTGGTCTCTACGGTTATGGG C AATTAAATG AC AGCAG C 

I gkSl - 3 6 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACTCGTCG 

I gkS 1 - 3 7 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGG CAATTAAATGACTCATCA 

I gkS 1 - 3 8 TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGG CAATTAAATGACTCTTCT 

IgkSl - 3 9 TT AGATTTG CAAC CAGAG AC AACTGGTCTCTACGGTTATGGG CAATTAAATG ACTCCTCC 

IgkCl TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

IgkC2 CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

I gkS 1-34 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkS 1-35 G AGG AGG AGGATG AAATAG ATGGTCCAGCTGG AC AAG C AGAACCGG ACAG AG CC CATT AC 

IgkS 1-36 G AGG AGGAGG ATGAAATAGATGGTC CAGCTGG ACAAGC AG AAC CGGAC AGAGCC CATTAC 

I gkS 1 - 3 7 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkSl-38 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkSl - 3 9 G AGG AGG AGGATGAAATAG ATGGTCCAGCTGG ACAAG C AG AACCGGACAG AGCC CATTAC 

IgkCl GAGG AGGAGG ATGAAATAGATGGTC CAGCTGG AC AAGC AG AAC CGGACAGAGC C CATTAC 

Ig kC2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

I gkS 1-34 AATATTGTAACCTTTTGTTGCAAGTGTGACAGTACGCTTCGGTTGTGCGTACAAAGTACA 

IgkSl - 3 5 AATATTGTAAC CTTTTGTTGCAAGTGTGAC AGCACGCTTCGGTTGTG CGTACAAAG C AC A 

IgkSl - 3 6 AATATTGTAACCTTTTGTTGCAAGTGTGACTCGACGCTTCGGTTGTGCGTACAATCGACA 

IgkSl - 3 7 AAT ATTGTAACCTTTTGTTG C AAGTGTG ACTCAACG CTTCGGTTGTG CGTAC AATCAACA 

I gkS 1-38 AATATTGTAAC CTTTTGTTG C AAGTGTG ACTCTACG CTTCGGTTGTGCGTAC AATCT AC A 

I gkS 1-39 AATATTGTAACCTTTTGTTGCAAGTGTGACTCCACGCTTCGGTTGTGCGTACAATCCACA 

I gkc 1 AATATTGTAAC CTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAG C AC A 

IgkC2 AACATCGTG ACCTTCTGCTGCAAGTG CG ACAGCAC CCTG CGC CTCTG CGTG C AGAGCACC 

301 311 321 331 341 351 

I gkS 1-34 C ACGTAGACATTCGT ACTTTGGAAGACCTGTTAATGGG C AC ACTAGGAATTGTGTG CC C C 

IgkS 1 - 3 5 CACGTAG AC ATT CGTACTTTGG AAGACCTGTT AATGGGCACACTAGG AATTGTGTG C CC C 

I gkS 1 - 3 6 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1-37 CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1-38 C ACGTAGACATTCGT ACTTTGGAAGACCTGTTAATGGG C AC ACTAGGAATTGTGTG C C C C 

IgkSl - 3 9 CACGTAGAC ATTCGTACTTTGG AAGAC CTGTTAATGGGCAC ACTAGGAATTGTGTG C C C C 

I gkCl CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

I gkS 1 - 3 4 ATCTGC AGTC AG AAG CCCTAAG AATTC 

I gkS 1-35 ATCTG CAG C CAG AAGCC CTAAGAATTC 

I gkS 1-36 ATCTG CTCGCAGAAGC C CTAAGAATTC 

I gkS 1-37 ATCTGCTC AC AG AAG C CCT AAG AATTC 

I gkS 1 - 3 8 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1 - 3 9 ATCTG CTC C CAG AAG CC CTAAGAATTC 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkC2 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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IgkCl GGTAC CG C CGCCACCATGGAGACAGAC ACACTCCTG CTATGGGTACTGCTG CTCTGGGTT 

IgkSl-40 GGTAC CGC CGCCACCATGGAGACGGACACG CTC CTGCTATGGGT ACTGCTGCTCTGGGTT 

IgkSl-4 1 GGT ACCGCCG C C AC CATGGAGAC AGACACACTC CTGCTATGGGTACTGCTGCTCTGGGTT 

I gkS 1-42 GGTACCGCCG C C AC CATGGAGACTGACACTCTC CTGCTATGGGTACTGCTGCTCTGGGTT 

IgkSl - 4 3 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTATGGGTACTGCTGCTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

I gkCl CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1 - 4 0 CCAGGTTCCACGGGTGACGGATCCATGCATGGAGATACGCCTACGTTGCATGAATATATG 

I gkS 1-41 CCAGGTTCCACAGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

I gkS 1 - 4 2 C C AGGTT CCACTGGTG AC GGATCCATGC ATGGAGATACTC CTACTTTG CATGAATATATG 

I gkS 1 - 4 3 C C AGGTTC CACCGGTGAC GGA TCCATGC ATGGAG ATAC C C CTACCTTG CATGAATATATG 

IgkC2 CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

IgkCl TTAG ATTTGCAACCAG AG AC AACTGGTCTCTACGGTTATGGG C AATTAAATGACAG CTC A 

I gkS 1-40 TTAGATTTGCAACCAGAGACGACGGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

I gkS 1 - 4 1 TTAGATTTGCAACCAGAGACAACAGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

I gkS 1 - 4 2 TTAGATTTGCAACCAGAGACTACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 

IgkSl - 4 3 TTAGATTTG C AACCAGAGAC C ACCGGTCTCT ACGGTTATGGG C AATTAAATGACAG CTCA 

IgkC2 CTGGACCTGC AG C C CGAGACCAC CGGCCTGTACGGCT ACGGCC AG CTCAACGACAGCAGC 

181 191 201 211 221 231 

IgkCl GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkS 1-40 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkS 1-41 G AGG AGG AGGATGAAATAGATGGTCCAGCTGGAC AAG CAGAAC CGGAC AGAGCC CATTAC 

IgkSl - 4 2 GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

IgkSl- 4 3 G AGGAGGAGG ATG AAATAGATGGTCCAGCTGG ACAAGCAGAACCGG ACAGAG C C CATTAC 

IgkC2 GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

IgkCl AATATTGT AAC CTTTTGTTG C AAGTGTG ACTCTACG CTTCGGTTGTG CGT AC AAAGC AC A 

IgkSl - 4 0 AATATTGTAACGTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACG 

IgkSl - 4 1 AATATTGTAACATTTTGTTGCAAGTGTGACTCTACACTTCGGTTGTGCGTACAAAGCACA 

IgkSl-42 AATATTGT AACTTTTTGTTGCAAGTGTGACTCTACTCTT CGGTTGTG CGT AC AAAGC ACT 

IgkSl - 4 3 AATATTGTAACCTTTTGTTG C AAGTGTGACTCT AC C CTTCGGTTGTG CGTACAAAG C ACC 

IgkC2 AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

IgkCl CACGTAGACATTCGTACTTTGGAAGAC CTGTT AATGGG C AC ACTAGGAATTGTGTG CCCC 

IgkSl - 4 0 CACGTAGACATTCGTACGTTGGAAGACCTGTTAATGGGCACGCTAGGAATTGTGTGCCCC 

I gkS 1-41 CACGTAGACATTCGTACATTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

I gkS 1 - 4 2 CACGTAGACATTCGTACTTTGGAAGAC CTGTTAATGGG C ACTCTAGGAATTGTGTG CCCC 

IgkSl-43 CACGTAGAC ATTCGT AC CTTGGAAGACCTGTTAATGGGCAC C CTAGGAATTGTGTGCCCC 

IgkC2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1-40 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1 - 4 1 ATCTGCTCTCAGAAGCCCTAAGAATTC 

I gkS 1-42 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkSl- 4 3 ATCTGCTCTCAGAAGCCCTAAGAATTC 

IgkC2 ATCTGCTCCCAGAAGCCCTAAGAATTC 
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IgkCl GGTAC CGCCG CC ACC ATGGAG AC AG ACAC ACTCCTG CTATGGGTACTGCTGCTCTGGGTT 

I gkS 1-44 GGTACCGCCGCCACCATGGAGACAGACACACTCCTG CTATGGGTACTGCTGCTCTGGGTT 

IgkSl-45 GGTACCGCCGCCACCATGGAGACAGACACACTCCTG CTATGGGTACTGCTGCTCTGGGTT 

IgkC2 GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

IgkCl CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
I gkS 1 - 4 4 CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 
I gkS 1 - 4 5 C CAGGTTCCACTGGTGACGGATCCATGCATGGAGATACAC CTAC ATTGC ATG AATAC ATG 

IgkC2 C CCGGCTCCACCGGCGACGGATCCATGCACGGCGACACC CCCAC CCTG C ACGAGTAC ATG 

121 131 141 151 161 171 

I gkC 1 TTAGATTTG CAACC AGAG ACAACTGGTCTCTACGGTTATGGG CAATTAAATG ACAGCTC A 

I gkS 1-44 TTAGATTTG C AAC C AG AG AC AACTGGTCTCTATGGTT ATGGG C AATTAAATGACAGCTC A 

I gkS 1-45 TTAGATTTG C AAC C AG AG AC AACTGGTCTCTACGGTTACGGG CAATTAAATG ACAGCTC A 

IgkC2 CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

I gkC 1 G AGGAGGAGGATGAAATAGATGGTC C AGCTGGAC AAGCAGAACCGGACAG AG CC CATTAC 

IgkS 1-44 G AGG AGGAGGATG AAATAGATGGTCCAGCTGGAC AAGCAG AAC CGG AC AG AG C CCATTAT 

I gkS 1 - 4 5 G AGG AGG AGG ATG AAATAGATGGTCCAGCTGGAC AAGCAG AAC CGG ACAG AG C C CATTAC 

IgkC2 GAGGAGG AGGACG AGATCGACGGCCC CGCCGG C C AGGC CGAG C CCG ACCG CG C CCACTAC 

241 251 261 271 281 291 

I gkC 1 AATATTGTAAC CTTTTGTTGC AAGTGTG ACTCTACG CTTCGGTTGTG CGT ACAAAG CAC A 

IgkS 1 - 4 4 AATATTGTAACCTTTTGTTGC AAGTGTG ACTCTACG CTTCGGTTGTGCGT ACAAAG CACA 

IgkS 1-45 AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 
IgkC2 AACATCGTGACCTTCTGCTGCAAGTG CGACAGCAC C CTG CGCCTCTGCGTG C AGAGC ACC 

301 311 321 331 341 351 

I gkC 1 CACGTAG AC ATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGG AATTGTGTGC C C C 

I gkS 1-44 CACGTAGAC ATTCGTACTTTGGAAG AC CTGTTAATGGG C AC ACTAGG AATTGTGTGCC C C 

I gkSl - 4 5 CACGTAGAC ATTCGTACTTTGGAAG AC CTGTTAATGGG CAC ACTAGG AATTGTGTG CCC C 

I gkC 2 CACGTGGACATCCGCACCCTGGAGGACCTGCTGATGGGCACCCTGGGCATCGTGTGCCCC 

361 371 381 

IgkCl ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkSl-44 ATCTGCTCTCAGAAGCCCTAAGAATTC 
I gkS 1-45 ATCTGCTCTCAGAAGCCCTAAGAATTC 
IgkC2 ATCTGCTC C CAG AAGCCCTAAGAATTC 
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GGTAC CG C CGCC ACCATGG AG AC AGAC ACACTC CTGCTATGGGTACTG CTGCTCTGGGTT 
GGTACCGC CG CCACCATGGAG AC AGAC ACACTC CTGCTATGGGTG CTG CTGCTCTGGGTG 
GGTAC CG C CG CC ACCATGG AG AC AGAC ACACTC CTG CTATGGGTACTG CTG CTCTGGGTA 
GGT AC CG C CG CCACCATGGAG AC AGAC ACACTC CTGCTATGGGTTCTG CTGCTCTGGGTT 
GGTACCGCCGCCACCATGGAGACAGACACACTCCTGCTATGGGTCCTGCTGCTCTGGGTC 
GGTACCGCCGCCACCATGGAGACCGACACCCTCCTGCTGTGGGTGCTGCTGCTCTGGGTG 

61 71 81 91 101 111 

C C AGGTTC CACTGGTG ACGGATCCATG C ATGGAG ATAC AC CTACATTG CATG AATATATG 

C C AGGTTC CACTGGTG ACGGATCCATGC ATGGAGATAC AC CTACATTG CATG AATATATG 

C CAGGTTCCACTGGTGACGGATCCATGCATGGAGATAC AC CTACATTG CATG AATATATG 

CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

CCAGGTTCCACTGGTGACGGATCCATGCATGGAGATACACCTACATTGCATGAATATATG 

CCCGGCTCCACCGGCGACGGATCCATGCACGGCGACACCCCCACCCTGCACGAGTACATG 

121 131 141 151 161 171 

TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
TTAGATTTG C AAC CAG AG AC AACTGGTCTCTACGGTTATGGGC AATTAAATG AC AGCTC A 
TTAGATTTGCAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
TTAGATTTG CAACCAGAGACAACTGGTCTCTACGGTTATGGGCAATTAAATGACAGCTCA 
TTAGATTTG CAACCAG AG AC AACTGGTCTCTACGGTTATGGG C AATTAAATG ACAG CTC A 
CTGGACCTGCAGCCCGAGACCACCGGCCTGTACGGCTACGGCCAGCTCAACGACAGCAGC 

181 191 201 211 221 231 

GAGG AGGAGGATG AAATAGATGGTC C AG CTGG ACAAG C AGAACCGG ACAG AGCCC ATTAC 

GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

GAGGAGGAGGATGAAATAGATGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTAC 

G AGG AGGAGG ATGAAATAG ATGGTC C AGCTGG AC AAGCAG AAC CGG AC AGAGC C CATTAC 

GAGG AGGAGGATG AAATAGATGGTCCAG CTGGAC AAGC AG AACCGG AC AGAG C C CATTAC 

GAGGAGGAGGACGAGATCGACGGCCCCGCCGGCCAGGCCGAGCCCGACCGCGCCCACTAC 

241 251 261 271 281 291 

AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

AATATTGTGACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTGCAAAGCACA 

AATATTGTAACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACA 

AATATTGTTACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTTCAAAGCACA 

AATATTGTCACCTTTTGTTGCAAGTGTGACTCTACGCTTCGGTTGTGCGTCCAAAGCACA 

AACATCGTGACCTTCTGCTGCAAGTGCGACAGCACCCTGCGCCTCTGCGTGCAGAGCACC 

301 311 321 331 341 351 

CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

CACGTGGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCCCC 

CACGTAGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTATGCCCC 

CACGTTGACATTCGTACTTTGGAAGACCTGTTAATGGGCACACTAGGAATTGTTTGCCCC 

CACGTCG AC ATTCGTACTTTGG AAGAC CTGTTAATGGGC ACACTAGGAATTGTCTGC C C C 

C ACGTGG ACATCCG CAC C CTGG AGGAC CTGCTGATGGG C AC C CTGGGC ATCGTGTG C C C C 

361 371 381 

ATCTG CTCTCAG AAGCCCTAAGAATTC 
ATCTGCTCTCAGAAGCCCTAAGAATTC 
ATCTG CTCTCAG AAGCCCTAAGAATTC 
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FIGURE 18 
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FIGURE 19 
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